GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology. Read more about Why. See also How to get started and our Privacy document.
ex.: “fálahas” is a non-normative variant of the lemma “fáláhas”, and it inflects. The normative form on the left side, and so the lemma in the analysis will be a normative form and can be found e.g. in the dictionary.
fáláhas:fáláhass JOHTOLAT ;
fáláhas+Err/Orth:fálahass JOHTOLAT ;
The descriptive FST will inflect both fálahas and fáláhas, but the line with the tag Err/Orth is removed from the normative analyser/generator during the compilation prosess.
fáláhasat
fáláhasat fáláhas+N+Pl+Nom
fálahasat
fálahasat fáláhas+Err/Orth+N+Pl+Nom
The normative analyser:
fáláhasat
fáláhasat fáláhas+N+Pl+Nom
fálahasat
fálahasat fálahasat +?
ex.: “fálahas” is a non-normative variant of the form “fáláhas”, and it does not inflect, and therefore it does not get a continuation lexicon with inflection for nouns.
The normative form on the left side, and so the lemma in the analysis will be a normative form and can be found e.g. in the dictionary.
fáláhas:fáláhass JOHTOLAT ;
fáláhas+N+Sg+Nom+Err/Orth:fálahas ENDLEX ;
Ex. brillefutterála which is a slightly adapated loanword from Norwegian to North Saami. The normative word is čalbmelássaskuohppu
brillefutterála+Err/Lex:brille#futterál SOSIAL
The descriptive FST will inflect brillefutterála, but the line with the tag Err/Lex is removed from the normative analyser/generator during the compilation prosess.
brillefutterálat
brillefutterálat brillefutterála+N+Pl+Nom
The normative analyser:
brillefutterálat
brillefutterálat brillefutterálat +?
Two lemmas, which base forms are homonyms, have different paradigms and semantics.
Example from North Saami. G3 tag for Grade 3 for consonantgradation with geminate in lemma, e.g. ss:
beassi:beassi BEARRI "reir" ;
beassi+G3:beas'si AIGI "never" ;
Analysis:
beassi
beassi beassi+N+G3+Sg+Nom
beassi beassi+N+G3+Sg+Acc
beassi beassi+N+G3+Sg+Gen
beassi beassi+N+Sg+Nom
beasi
beasi beassi+N+Sg+Gen
beasi beassi+N+Sg+Acc
Example from North Saami. NomAg tag for derivation Nomen Agent
vuovdi+NomAg:vuovdi ACTOR "salesman" ;
vuovdi:vuov'di AIGI "forest" ;
Analysis:
vuovdi
vuovdi vuovdi+N+NomAg+Sg+Nom
vuovdi vuovdi+N+NomAg+Sg+Acc
vuovdi vuovdi+N+NomAg+Sg+Gen
vuovdi vuovdi+N+Sg+Nom
vuovddi
vuovddi vuovdi+N+Sg+Gen
vuovddi vuovdi+N+Sg+Acc
Example from South Saami:
govledh+Hom1:govl TJOEHPEDH_TV "höra" ;
govledh+Hom2:govl VÅÅJNEDH "höras" ;
Analysis:
gåvla
gåvla govledh+Hom1+V+TV+Ind+Prs+Sg3
govloe
govloe govledh+Hom2+V+IV+Ind+Prs+Sg3
Orthograpic variants of the same lemma, for base form and at least parts of the inflection paradigm, should be under the same lemma. But we can add a variants tag as a help to recognize the correct base form for the paradigm.
Example from North Saami.
mandáhta+v2:mandáhtta GOAHTI-A ;
mandáhta+v1:mandáhta STAHTA ;
If the base forms are identical, but there are variants in the inflection, we don’t use these tags.