Finite state and Constraint Grammar based analysers, proofing tools and other resources
View the project on GitHub giellalt/lang-fao
Usage, in lang-fao
:
cat text.txt|hfst-tokenize -cg tools/tokenisers/tokeniser-disamb-gt-desc.pmhfst |vislcg3 -g src/cg3/disambiguator.cg3
This file documents the Faroese disambiguator file .
Test: Go for minimal weight. This rules gives priority to lexicalised forms.
Mostly we map both @CNP and @CVP, then we select @CNP, after that we remove them so @CVP remains
CCasCNPCVP Map (@CNP @CVP) to CC
killAllahtenotCS All occurrences of “at” are CSs.
Kill Sem/ID
killAllCNP removes all remaining @CNP
XCC-CS removes CC and CS with no synttag
ErrOrth goes for correct forms
X removes readings with no syntax
This (part of) documentation was generated from src/cg3/disambiguator.cg3