North Sami NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-sme

Page Content

Divvun & Giellatekno - open source grammars for Sámi and other languages

North Sámi compounding

This file governs prefixing and compounding, with the following lexica and pointers. All lexica and lexicon entries are documented.

LEXICON Prefixes = lexicon for adding *eahpe and pointing to N, A, V

LEXICON R = lexicon which is pointed to from affixes files. Here the strings get flags to control compounding (@P.CmpFrst.FALSE@ etc.) and are redirected to RAlmostReal.

LEXICON RAlmostReal = lexicon pointed to from R (where flags are added) and pointing to RrealAfterCmpNFlags and (with +Cmp tag) to MiddleNouns. lexicalising the 3-part compounds, with the tag ShCmp. It has two entries:

LEXICON Rreal = This is the former R lexicon, renamed to avoid the MiddleNouns loop. The string gets flags like for R, and directed to RrealAfterCmpNFlags.

LEXICON RrealAfterCmpNFlags = This was also part of the former R lexicon, here renamed to avoid the MiddleNouns loop. Here it gets flags ensuring the result is N+N.

LEXICON RHyph = Recursive lexicon from all classes REQUIRING a hyphen to follow.

LEXICON RHyphTags = adds +Cmp/Hyph and +Cmp, and then - on lower side.

LEXICON RNum = For Num Cmp Noun, vi vil ikke ha Num Cmp Num

LEXICON Rnoun = the lexicon has two entries:

LEXICON RProp = lexicon pointed to from propernouns, and containing 3 entries

LEXICON RPropTags = A special lexicon for handling proper noun compounding without hyphens. Two entries:

LEXICON flagON-R = turns NeedsVowRed on:

LEXICON flagOFF-R = turns NeedsVowRed off:


This (part of) documentation was generated from src/fst/morphology/compounding.lexc