Mansi NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-mns

mns meeting 27.6. 2023

Present: Csilla, Jack, Trond.

Agenda

Status

for the work

Coverage:

for the fst

Problems with verbs

Verbs and nouns are quite alike, but we look at different forms. The verbal stem is close to the noun lemma. Trond has check

Priorities onward

Linguistic:

Summer

Csilla will work, and consult Trond or Jack whenever running into trouble.

working with missing lists

Two ways of working:

  1. Csilla adds missing words to fst and typos to typos.txt
  2. Csilla uses the file.missing.freq.date file and writes comments to each word, one of the following:
    • fixed (or: no comment means “Csilla fixed it”)
    • correct # this means: must be fixed in fst
    • typo > correctform # means: Csilla adds it to typos.txt

Trond to make sure the missing lists are run without long vowel problems.

Working with Yamls

Csilla to make sure they are ok. Lemmas should be tested only once.

Next meeting

We will look for a date it in a month or so (perhaps less).