Tornedalen Finnish NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-fit

Disambiguator for Meänkieli

Usage:

cat text.txt|hfst-tokenize -cg tools/tokenisers/tokeniser-disamb-gt-desc.pmhfst |vislcg3 -g src/cg3/disambiguator.cg3

This file documents the Meänkieli disambiguator file .

Delimiters, tags and sets

Sentence delimiters are the following: “<.>” “<…>” “<!>” “<?>” “<¶>”

Part-of-Speech

Numerus

Person

Cases

Types

Sets with more members

Boundaries

Verbs

Disambiguation rules

Dialects

Early rules

Possessive suffixes

First we put rules to choose Px forms… (forthcomong)

Then we remove the remaining Px

Numeral phrases

Preposition/postposition/adverb rules

Rules for mapping @CVP and @CNP on the CC and CS

Case rules

Partitive

Genitive

Illative

Number rules

More disambiguation rules

Elative

Propernouns

Verbs

Specific verbs

ei negation verb

eli

Adverbs

paljon

kerran

jälkhiin

Adjectives

toinen

Conjunctions

Subjunctions

että

jos

ko

mutta

sillä

Pronouns

sie

tet

Verb rules, Verbs

Infinitive

Present Sg3

Present Pl3 or PrsPrc

Present Pl3 or Passive

Imperative

Past tense

Prt Pl3 or Prt Sg2

Relative pronouns

HNOUN MAPPING


This (part of) documentation was generated from src/cg3/disambiguator.cg3