Finite state and Constraint Grammar based analysers, proofing tools and other resources
The essence of testing: With testing, we want to check 3 things:
make
)With make
you check that there are no technical issues. Read the report.
ERROR: Sublexicon is mentioned but not defined. (PYERRI)
ERROR: Sublexicon is mentioned but not defined. (muvgâ PUIGA)
ERROR: Compilation aborted because of 1 parse error. line 11634: syntax error at "suoraan"
Segmentation fault: 11, Error 139
With make check
you also check the morphology:
Test only yaml-tests with this command:
sh test/yaml-check.sh
Test only genererating of nouns, adjectives, verbs, propernouns (no yaml-tests) with this command:
sh test/check_lemmas.sh
Analyse the forms:
usmn
and usmnNorm
analyse e.g. nieidáin
If you don’t get any analysis, only ?
, then you should generate the word:
dsmn
and dsmnNorm
generate the forms, e.g. nieidâ+N+Sg+Com
Are you not quite sure that you you have a new analyser and generator? How to check the date/time for when you analyser/generator was compiled:
` ll src/`
When you are in langs/smn - the quick commands:
sh devtools/noun_minip.sh nieidâ
sh devtools/adj_minip.sh uánehâš
sh devtools/prop_minip.sh Aanaar
Get only the correct lemma and not compounds:
sh devtools/noun_minip.sh '^nieidâ[:+]'
Look at all lemmas going to the same continuation lexicon:
sh devtools/noun_minip.sh PIIVTAS | less
sh devtools/adj_minip.sh KOOIDAS | less
Look at all forms:
grep '¢' src/fst/stems/nouns.lexc | cut -d '¢' -f2 | cut -d '!' -f1 |preprocess |grep '[a-z]' |usmnNorm |less
Get the forms which are not recognized by the analyser:
grep '¢' src/fst/stems/nouns.lexc | cut -d '¢' -f2 | cut -d '!' -f1 |preprocess |grep '[a-z]' |usmnNorm |grep '\?' | less