Skolt Sami NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-sms

Skolt Sámi disambiguator

Note: This documentation file is still work-in-progress, and should not yet be used. Read the source file instead.

Delimiters

DELIMITERS = “<.>” “<!>” “<?>” “<…>” “<¶>”;

Tags and sets #

We declare BOS, EOS and all the tags from the fst.

Disambiguation

Cycle 0, rules without context

Possessive suffix

Probably exists only for Refl and for kinship terms In Skolt Sami Possessive suffixes ARE USED Jaska 2020-11-08

Pronouns and nouns

Postpostions

Short Pronouns

No rules.

Proper nouns

Cycle 1

Numerals

Trivialia

Nouns

Nominative plural

Genitive

Verbs

Imperative

There can be Interj, VOC,

Genitive modifier

Subject

M A P P I N G

CC- and CS-Mapping

CASES

PrfPrc

Person

Nomen

Verb or Noun

Dem

No rules

CC and CS or Adv

Adj or Adv

grammatisk ord eller N eller A

N or Adj

N or V

Ger or Der/NomAct

Adj or Indef

Num

Rel or Interr

Interj

no rule

Po or Pr

Adv or Po/Pr

Com

Accusative or illative

Accusative or Genitive

Indef or Adv

special lemmas

no rules

Verb person vs. Inf – moved here in order to have the pronouns disambiguated first.

Proper nouns

Rule set taken from sme

Substituting Prop tags

Prop or not

Removing proper nouns that are lookalikes

Particular proper nouns

Todo: sms-ify.

Mapping rules

SAFE RULES

subject rules and spred rules

Removing Err/Orth

Denne regelen fjerner Err/Orth når det er samme lemma, sjøl om morfologien er forskjellig.


This (part of) documentation was generated from src/cg3/disambiguator.cg3