North Sami NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-sme

Page Content

North Saami numerals

The initial lexica

The LEXICON CmpNumeral lexicon is the entrance for compounds with numbers. Introduced to restrict such compounding to a subgroup of numerals only, mainly to exclude roman numerals, that turned out to be too problematic. With this change, roman numerals are only recognised on their own.

Arabic numerals

Arabic numeral expressions can be classified in at least the following categories:

And for sure more than these. Previously everything has been more or less lumped together, but to avoid noise and to get better input for grammar checking the ARABICS section should be rewritten such that each category gets its own lexicon. That way it is easier to restrict the syntax of numerical expressions in each category.


This (part of) documentation was generated from src/fst/morphology/stems/numerals.lexc