32 hours agoUpdated licensing information (AGPL/GPL) for files in ./florian master
Florian Breit [Wed, 23 Apr 2014 13:34:47 +0000]
Updated licensing information (AGPL/GPL) for files in ./florian

4 weeks agoEOL of files in florian/ from \r\n to \n, and removed newline at EOF.
Florian Breit [Mon, 24 Mar 2014 20:12:41 +0000]
EOL of files in florian/ from \r\n to \n, and removed newline at EOF.

6 weeks agoCurrent versions of files in florian/
Florian Breit [Sun, 9 Mar 2014 18:12:08 +0000]
Current versions of files in florian/

6 weeks ago- Fixed array indices called without quotes.
Florian Breit [Sun, 9 Mar 2014 17:51:25 +0000]
- Fixed array indices called without quotes.
- Fixed call of variables with .= operator before they are declared
- Fixed conditional assigning of indices and variables that are not declared
=> Simple do_everything.php runs now work without producing E_NOTICE and E_WARNINGs.

Some SQL Warnings remain, namely:
  Warning: Tag 'or' on line 70 looks like a set operator. Maybe you meant to do SET instead of LIST?
  Warning: Tag '+' on line 706 looks like a set operator. Maybe you meant to do SET instead of LIST?
  Warning: Tag 'or' on line 70 looks like a set operator. Maybe you meant to do SET instead of LIST?
  Warning: Tag '+' on line 706 looks like a set operator. Maybe you meant to do SET instead of LIST?
I'm not sure where they originate, so did not fix them.

2 months agoEdits to cylist
donnek [Tue, 11 Feb 2014 14:24:08 +0000]
Edits to cylist

2 months agoMerge branch 'mybranch'
donnek [Tue, 11 Feb 2014 14:20:18 +0000]
Merge branch 'mybranch'

2 months agoOne file
donnek [Tue, 11 Feb 2014 14:19:59 +0000]
One file

2 months agoVarious edits
donnek [Tue, 11 Feb 2014 14:16:12 +0000]
Various edits

8 months agoOther stuff
donnek [Sat, 3 Aug 2013 09:36:58 +0000]
Other stuff

8 months agoCognate, clause and insertion work
donnek [Sat, 3 Aug 2013 09:32:48 +0000]
Cognate, clause and insertion work

14 months agoList of files to be reglossed
donnek [Fri, 25 Jan 2013 13:36:46 +0000]
List of files to be reglossed

20 months agoClause analysis improvements. Mixed model improvements.
donnek [Sat, 28 Jul 2012 10:55:03 +0000]
Clause analysis improvements. Mixed model improvements.

20 months agoFlorian citation
donnek [Sat, 28 Jul 2012 10:10:49 +0000]
Florian citation

20 months agoDisso scripts
donnek [Fri, 27 Jul 2012 16:57:56 +0000]
Disso scripts

20 months agoTalkbank packaging
donnek [Fri, 27 Jul 2012 16:15:25 +0000]
Talkbank packaging

21 months agoImprove mixed model. Delete classes of words. Silence audiofiles with speaker-only...
donnek [Tue, 26 Jun 2012 20:50:56 +0000]
Improve mixed model. Delete classes of words. Silence audiofiles with speaker-only sound-bullets.

22 months agoMixed model analysis
donnek [Mon, 18 Jun 2012 23:06:17 +0000]
Mixed model analysis

23 months agoGenerate clause data for variation analysis. Utilities
donnek [Sat, 12 May 2012 08:41:08 +0000]
Generate clause data for variation analysis. Utilities

2 years agoClause-splitting. Collapse autogloss [or]s.
donnek [Mon, 26 Mar 2012 15:48:55 +0000]
Clause-splitting. Collapse autogloss [or]s.

2 years agoSelectively silence audiofiles. Convert files from one predominant language to another.
donnek [Thu, 2 Feb 2012 11:09:40 +0000]
Selectively silence audiofiles.  Convert files from one predominant language to another.

2 years agoTranslator format. Manual edit system.
donnek [Thu, 19 Jan 2012 08:35:30 +0000]
Translator format. Manual edit system.

2 years agoPrecode fixes, codeswitch counting refinements, read running monolingual text.
donnek [Fri, 9 Dec 2011 15:12:46 +0000]
Precode fixes, codeswitch counting refinements, read running monolingual text.

2 years agoAdditional cognates files.
donnek [Fri, 25 Nov 2011 13:43:09 +0000]
Additional cognates files.

2 years agoConcordance for words. Show diffs of autogloss.
donnek [Thu, 24 Nov 2011 23:17:34 +0000]
Concordance for words.  Show diffs of autogloss.

2 years agoNew method for tracking codeswitches.
donnek [Sun, 6 Nov 2011 08:36:49 +0000]
New method for tracking codeswitches.

2 years agoAmendments to tex, cognates, clauses. Addition of some utils.
donnek [Fri, 4 Nov 2011 09:33:05 +0000]
Amendments to tex, cognates, clauses.  Addition of some utils.

2 years agoFind/correct global typos. CSV version of trigram output. Generate lingscrb output.
donnek [Wed, 14 Sep 2011 12:01:47 +0000]
Find/correct global typos.  CSV version of trigram output. Generate lingscrb output.

2 years agoMake global changes. Add empty %eng tier. Fix import of end-terminators.
donnek [Wed, 24 Aug 2011 10:12:40 +0000]
Make global changes.  Add empty %eng tier. Fix import of end-terminators.

2 years agoGather unknowns, pick out trigrams, generate word-index
donnek [Sat, 6 Aug 2011 12:05:59 +0000]
Gather unknowns, pick out trigrams, generate word-index

2 years agoRevise preparation and import process to handle Miami corpus - easier workflow.
donnek [Thu, 14 Jul 2011 19:55:07 +0000]
Revise preparation and import process to handle Miami corpus - easier workflow.

2 years agoRecreate create_cgwords.php
donnek [Mon, 27 Jun 2011 19:47:46 +0000]
Recreate create_cgwords.php

2 years agoChanges for MC project
Kevin Donnelly [Mon, 27 Jun 2011 10:19:42 +0000]
Changes for MC project

2 years agoFile prep now fixes typos where no space precedes the period. Initial angle bracket...
donnek [Mon, 27 Jun 2011 08:24:49 +0000]
File prep now fixes typos where no space precedes the period. Initial angle bracket no longer eats pre-backtrack words when importing in MOR mode.

2 years agoRiga paper
Kevin Donnelly [Wed, 22 Jun 2011 11:18:02 +0000]
Riga paper

2 years agoRiga paper
donnek [Wed, 22 Jun 2011 08:47:45 +0000]
Riga paper

2 years agoMC queries
Kevin Donnelly [Sun, 19 Jun 2011 10:45:25 +0000]
MC queries

2 years agoBeginning morph collection
Kevin Donnelly [Sun, 19 Jun 2011 08:38:29 +0000]
Beginning morph collection

2 years agoSarah's notes
donnek [Mon, 13 Jun 2011 19:51:15 +0000]
Sarah's notes

2 years agoOslo dbs
donnek [Mon, 13 Jun 2011 16:34:50 +0000]
Oslo dbs

2 years agoRevised dictionaries
donnek [Mon, 13 Jun 2011 16:09:48 +0000]
Revised dictionaries

2 years agoRevised dictionaries
donnek [Sat, 11 Jun 2011 22:20:01 +0000]
Revised dictionaries

2 years agoMOR comparison, cognates, etc
donnek [Sat, 11 Jun 2011 22:15:13 +0000]
MOR comparison, cognates, etc

2 years agoClauses data
donnek [Tue, 24 May 2011 11:43:26 +0000]
Clauses data

2 years agoAdded GPL notice to main files.
donnek [Thu, 19 May 2011 21:50:32 +0000]
Added GPL notice to main files.

2 years agoBasic clause-splitting for Spanish and English
donnek [Thu, 19 May 2011 18:46:51 +0000]
Basic clause-splitting for Spanish and English

2 years agoFixed clar splits in cylist
donnek [Sun, 15 May 2011 15:07:17 +0000]
Fixed clar splits in cylist

2 years agoRiga presentation
Kevin Donnelly [Sun, 15 May 2011 09:45:51 +0000]
Riga presentation

2 years agocg2011 presentation
donnek [Sun, 8 May 2011 22:07:06 +0000]
cg2011 presentation

2 years agoScripts to automate converting and autoglossing an entire corpus. Scripts to automat...
donnek [Sat, 7 May 2011 09:49:10 +0000]
Scripts to automate converting and autoglossing an entire corpus.  Scripts to automate sampling and splitting.

2 years agoRevised clause-splitter to v0.3. Adjusted sequence of application to make sampling...
donnek [Thu, 5 May 2011 16:07:59 +0000]
Revised clause-splitter to v0.3.  Adjusted sequence of application to make sampling and splitting quicker.

2 years agoRevisions to clause-splitter. New file-sampling method. Revised output.
donnek [Mon, 2 May 2011 04:37:13 +0000]
Revisions to clause-splitter.  New file-sampling method.  Revised output.

3 years agoInitial work on clause-splitter for Welsh
donnek [Tue, 19 Apr 2011 09:35:01 +0000]
Initial work on clause-splitter for Welsh

3 years agoCompare auto adn human glossing. Break surface into cognate-bounded segments.
donnek [Mon, 11 Apr 2011 05:50:54 +0000]
Compare auto adn human glossing.  Break surface into cognate-bounded segments.

3 years agoChanges to allow @cym...
donnek [Sat, 26 Mar 2011 14:07:40 +0000]
Changes to allow @cym&eng items to be looked up in enlist.  Changes to global activity scripts.  Changes to allow ExPex to handle all the tiers.  Bugfix: markers for Welsh mutations not showing.
Move back to parsing of applied CG instead of reading the word id from the db.

3 years agoRevised Welsh dictionary
donnek [Tue, 15 Mar 2011 23:25:50 +0000]
Revised Welsh dictionary

3 years agoAdditional rules for Welsh
donnek [Tue, 15 Mar 2011 23:13:09 +0000]
Additional rules for Welsh

3 years agoRemoved output files
donnek [Tue, 15 Mar 2011 23:18:28 +0000]
Removed output files

3 years agoGather unknown words. Changes to Welsh autoglosser.
donnek [Thu, 24 Feb 2011 21:20:00 +0000]
Gather unknown words. Changes to Welsh autoglosser.

3 years agoStore and write cha file headers. Fix regressions in Welsh rules.
donnek [Tue, 22 Feb 2011 10:01:14 +0000]
Store and write cha file headers.  Fix regressions in Welsh rules.

3 years agoMore minor changes
donnek [Thu, 17 Feb 2011 08:17:15 +0000]
More minor changes

3 years agoMinor changes
donnek [Thu, 17 Feb 2011 08:15:27 +0000]
Minor changes

3 years agoUpdated dictionaries after POS changes.
donnek [Mon, 24 Jan 2011 09:07:57 +0000]
Updated dictionaries after POS changes.

3 years agoFurther CG changes. Enable typesetting of .cha files. Enable conversion to precode...
donnek [Mon, 24 Jan 2011 09:05:22 +0000]
Further CG changes.  Enable typesetting of .cha files.  Enable conversion to precode format.  Enable MOR/POST tagging.

3 years agoNotes
Kevin Donnelly [Wed, 12 Jan 2011 12:27:18 +0000]
Notes

3 years agoCG changes again
donnek [Wed, 12 Jan 2011 09:09:47 +0000]
CG changes again

3 years agoCG changes after the POS changes
donnek [Wed, 12 Jan 2011 09:03:50 +0000]
CG changes after the POS changes

3 years agoNew dictionary POS tags and revised Spanish grammar
donnek [Mon, 10 Jan 2011 09:40:25 +0000]
New dictionary POS tags and revised Spanish grammar

3 years agoTidying
donnek [Tue, 28 Dec 2010 13:02:26 +0000]
Tidying

3 years agoAllow use of new CLAN default scheme, and import of text glossed via MOR/POST
donnek [Tue, 28 Dec 2010 13:00:28 +0000]
Allow use of new CLAN default scheme, and import of text glossed via MOR/POST

3 years agoScripts to allow conversion to new CLAN default (with precodes)
donnek [Wed, 15 Dec 2010 23:09:12 +0000]
Scripts to allow conversion to new CLAN default (with precodes)

3 years agoCompleted changes to allow handling of new CLAN default (with precodes).
donnek [Wed, 15 Dec 2010 22:23:24 +0000]
Completed changes to allow handling of new CLAN default (with precodes).

3 years agoChanges to handle new CLAN default
Kevin Donnelly [Wed, 15 Dec 2010 11:53:21 +0000]
Changes to handle new CLAN default

3 years agoChanges to dictionaries.
donnek [Wed, 15 Dec 2010 09:25:05 +0000]
Changes to dictionaries.

3 years agoGenerating conversation profiles
donnek [Mon, 13 Dec 2010 13:01:54 +0000]
Generating conversation profiles

3 years agoAdditional CG rules.
donnek [Thu, 9 Dec 2010 22:53:10 +0000]
Additional CG rules.

3 years agoGenerate language profile for each utterance
Kevin Donnelly [Mon, 13 Dec 2010 11:41:12 +0000]
Generate language profile for each utterance

3 years agoGenerate conversation profile
Kevin Donnelly [Wed, 8 Dec 2010 11:47:50 +0000]
Generate conversation profile

3 years agoCG rules
Kevin Donnelly [Mon, 6 Dec 2010 12:31:50 +0000]
CG rules

3 years agoExpanded CG rules for Spanish and English; changes to the dictionaries
donnek [Mon, 6 Dec 2010 09:11:24 +0000]
Expanded CG rules for Spanish and English; changes to the dictionaries

3 years agoEdits of grammar rules
Kevin Donnelly [Wed, 1 Dec 2010 11:27:33 +0000]
Edits of grammar rules

3 years agoDb and file additions
donnek [Sat, 27 Nov 2010 21:22:25 +0000]
Db and file additions

3 years agoSigned-off-by: donnek <kevin@dotmon.com>
donnek [Sat, 27 Nov 2010 20:59:29 +0000]
Signed-off-by: donnek <kevin@dotmon.com>

3 years agoTools to log unknown words
Kevin Donnelly [Mon, 22 Nov 2010 12:39:38 +0000]
Tools to log unknown words

3 years agoImprovements to Spanish grammar, etc.
donnek [Mon, 22 Nov 2010 09:56:01 +0000]
Improvements to Spanish grammar, etc.

3 years agoVCS changes demo
donnek [Tue, 9 Nov 2010 11:24:34 +0000]
VCS changes demo

3 years agoChanges to allow English lookup
donnek [Tue, 9 Nov 2010 11:20:50 +0000]
Changes to allow English lookup

3 years agoVCS demo
donnek [Tue, 9 Nov 2010 11:18:26 +0000]
VCS demo

3 years agoRationalise dbs.
donnek [Mon, 1 Nov 2010 16:37:10 +0000]
Rationalise dbs.

3 years agoFixes, addition of code to handle %gra tier, revisions to CG on foot of on-the-fly...
donnek [Mon, 1 Nov 2010 16:31:13 +0000]
Fixes, addition of code to handle %gra tier, revisions to CG on foot of on-the-fly demutation

3 years agoFis for apostrophes disappearing in import; revisions to manual
Kevin Donnelly [Mon, 11 Oct 2010 10:57:50 +0000]
Fis for apostrophes disappearing in import; revisions to manual

3 years agoFixes to import process
donnek [Mon, 4 Oct 2010 21:10:01 +0000]
Fixes to import process

3 years agoDocs changes
Kevin Donnelly [Mon, 4 Oct 2010 11:21:50 +0000]
Docs changes

3 years agoRefactoring of the Welsh dictionary
donnek [Sat, 11 Sep 2010 13:17:34 +0000]
Refactoring of the Welsh dictionary

3 years agoModular lookups for the new cohort writing github/master
donnek [Tue, 7 Sep 2010 10:08:18 +0000]
Modular lookups for the new cohort writing

3 years agoChanges to how the output file is written
donnek [Tue, 7 Sep 2010 07:22:59 +0000]
Changes to how the output file is written

3 years agoOn-the-fly clitic segmentation and lookup
donnek [Mon, 30 Aug 2010 11:12:32 +0000]
On-the-fly clitic segmentation and lookup

3 years agoAutomatically create dummy cleaning functions for sub-tiers
donnek [Mon, 23 Aug 2010 08:18:02 +0000]
Automatically create dummy cleaning functions for sub-tiers

3 years agoFixes to allow import of files with MOR sub-tier; deletion of backtracking words
donnek [Sat, 21 Aug 2010 19:02:00 +0000]
Fixes to allow import of files with MOR sub-tier; deletion of backtracking words

3 years agoAdjust import to handle scanned subtiers
donnek [Fri, 20 Aug 2010 08:13:14 +0000]
Adjust import to handle scanned subtiers
  Add new function tier_fields

3 years agoCompletion of scantiers
donnek [Sun, 15 Aug 2010 13:03:44 +0000]
Completion of scantiers
Allows the utterances table to dynamically include field for each of
the subordinate tiers.