Finite state and Constraint Grammar based analysers, proofing tools and other resources
View the project on GitHub giellalt/lang-olo
All doc-comment documentation in one large file.
This dep file is for sma, sme, smj, sje.
Sentence delimiters are the following: <.> <!> <?> <…> <¶>
N V A Adv CC CS Inf Sup Neg Num Po Pr
Pcle Prop
Pron IV TV COMMA DASH CITATION to keep colouring we add a “ HYPHEN QMARK PUNCT LEFT RIGHT CLB Ind Pot Impr ImprtII Cond ConNeg Caus causative eus VGen Interj ABBR ACR Prs Prt Cmpnd RCmpnd PrfPrc PrsPrc Actor Actio Ger Indef Nom Acc Ill Com Gen Ess
IM For fao
Correction rules
muitalit
XX
XX
XX
faoSumId=Rel
lgRemove removes the language tags
This (part of) documentation was generated from src/cg3/dependency.cg3
Disambiguator for Olonets
Sentence delimiters are the following: “<.>” “<…>” “<!>” “<?>” “<¶>”
Indef = Indef pron
Neg = Negation verb
COMMA = comma
WORD = all PoS
NPMODADV = NPMOD plus adverb
NOT-NPMOD = these cannot modify a noun
NOT-NPMODADV = these cannot modify a noun, and is not adverb
Boundaries
Verbs
person_test selects finite verb if there is a Pron Pers to the left
adv_after_V selects adverb if there is a verb to the right
prop_infrontof_kieli removes propernoun in fron of kieli, if it kan be something else, e.g. Kainun kieli
PropInit removes propernoun in the beginning of a sentence if it kan be a CC or a Pr (e.g. Mutta)
PropNotInit selects propernoun if it is not in the beginning of a sentence
Possessive suffixes
Numeral phrases
Prifgenpar selects preposition to the left of Gen or Par
Poifgenpar selects postposition to the right of Gen or Par
vasthaan
CVP maps @CVP to CS and mutta
CNPifN maps @CNP to CC between two N
CNPifInf maps @CNP to CC between two Inf
Genitive
ei negation verb
eli
Conjunctions
että
jos
ko
sillä
Imperative
Relative pronouns
Pl3ollaifplrelpronandplinterrpron selects Pl3 if olla
Sg3ollaifplrelpronandplinterrpron selects Sg3 if olla
Sg3ollainpretandperf selects Sg3 if COPULAS
Sg3ollainpretandperf selects Sg3 if COPULAS
Relpronandnotintterpron selects Rel Sg if Interr
Relpronandnotintterpron selects Rel Sg if Interr
interrpron selects Interr if ? in the end
DifferenceBetweenNiitäImprtAndNiitäDemAndPersIfSubj selects Pron Dem Pl or Pron Pers Pl3 when finite verb to the right
paljonadvandnotpaljonoun selects Adv if paljon
Relpronifitsanounoracommabeforeit selects Rel Pl if N to the left
annaimperativeandnotannaname removes Prop if Anna se
tulinounfromtuliprtsg3 selects V Sg
dempronandnotpronpers selects Den if A of N to the right
Imperativefromconneg selects and removes ConNeg
ImperativeafterNeg removes Imprt if pronoun
interrel selects Interr of Rel if CS to the right
+FMAINV to the remaining finite verbs which are not AUX
@<ADVLcoor (@<ADVL) for ADVLCASEAdv if @CNP to the left and ADVL to the left of it
X maps X everywhere
REMOVE X removes X whenever there is any other tag.
WORDLEMMA = regex giving the lemma in question
errorth removes Err/Orth if there is an analysis without Err/Orth with the same lemma
This (part of) documentation was generated from src/cg3/disambiguator.cg3
Sets for POS sub-categories
Sets for Semantic tags
Sets for Morphosyntactic properties
Sets for verbs
V is all readings with a V tag in them, REAL-V should
be the ones without an N tag following the V.
The REAL-V set thus awaits a fix to the preprocess V … N bug.
The set COPULAS is for predicative constructions
NP sets defined according to their morphosyntactic features
The PRE-NP-HEAD family of sets
These sets model noun phrases (NPs). The idea is to first define whatever can occur in front of the head of the NP, and thereafter negate that with the expression WORD - premodifiers.
The set NOT-NPMOD is used to find barriers between NPs. Typical usage: … (*1 N BARRIER NPT-NPMOD) … meaning: Scan to the first noun, ignoring anything that can be part of the noun phrase of that noun (i.e., “scan to the next NP head”)
Miscellaneous sets
Border sets and their complements
Syntactic sets
These were the set types.
hab1
hab2
hab3 (
habNomLeft
hab4
hab6
hab7
hab5 This is not HAB
habDain (
habGen (
spred<obj (@SPRED<OBJ) for Acc; the object of an SPRPED. Not to be mistaken with OPRED. If SPRED is to the left, and copulas is to the left of it. Nom or Hab are found sentence initially.
Hab<spred (@<SPRED) for Nom; if copulas, goallut or jápmit is FMAINV and habitive or human Loc is found to the left. OR: if Ill or @Pron< followed by HAB are found to the left.
Hab>Advlcase<spred (
Nom>Advlcase<spred (
<spred (
<spred (
<spredQst1 (
<spredQst2 (@<SPRED) for Nom; in a typically question sentence; differs from <spredQst1 by not beeing as restricted to the right. Though you are not allowed to be Pers or human.
Nom<spredQst (@<SPRED) for Nom; in a typically question sentence. Differs from <spredQst2 by letting Nom be found between SPRED and copulas
<spred (@<SPRED) for A Nom or N Nom if; the subject Nom is on the same side of copulas as you: on the right side of copulas
<spredVeara (@<SPRED) for veara + Nom; if genitive immediately to the right, and intransitive mainverb to the right of genitive
leftCop<spred (@<SPRED) for Nom; if copulas is the main verb to the left, and there is no Ess found to the left of cop (note that Loc is allowed between target and cop). OR: if you are Coll or Sem/Group with copulas to your left.
<spredLocEXPERIMENT (@<SPRED) for material Loc; if you are to the right of copulas, and the Nom to the left of copulas is not a hab-actor
NumTime (@<SPRED) for A Nom
<spredSg (@<SPRED) for Sg Nom
<spredPg (@<SPRED) for Pl Nom
<spred (@<SPRED) for Nom; if copulas to the left, and Nom or sentence boundary to the left of copulas. First one to the right is EOS.
<spred (@<SPRED) for N Ess
spredEss> (@SPRED>) for N Ess; if copulas to the right of you, and if an NP with nom-case first one to your left.
HABSpredSg> (@SPRED>) for Nom; if habitive first one to the left, followed by copulas.
GalleSpred> (@SPRED>) for Num Nom; if sentence initial
spredSgMII> (@SPRED>)
r492> (@SPRED>) for Interr Gen; consisting only of negations. You are not allowed to be MII. You are not allowed to have an adjective or noun to yor right. You are not allowed to have a verb to your right; the exception beeing an aux.
AdjSpredSg> (@SPRED>) for A Sg Nom; if copulas to the right, but not if A or @<SPRED are found to the right of copulas
SpredSg>Hab (@SPRED>) for Nom; if you are sentence initial, copulas is located to the right, and there is a habitive to the right of copulas
Spred>SubjInf (@SPRED>) for Nom; if copulas to the right, and the subject of copulas is an Inf to the right
spredCoord (@<SPRED) coordination for Nom; only if there already is a SPRED to the left of CNP. Not if there is some kind of comparison involved.
subj>Sgnr1 (@SUBJ>) for Nom Sg, including Indef Nom if; VFIN + Sg3 or Pl3 to the right (VFIN not allowed to the left)
subj>Pl (@SUBJ>) for plural nominatives, including Coll and Sem/Group. VFIN + Pl3 to the right.
subj>Pl (@SUBJ>) for plural nominatives
subj>Sgnr2 (@SUBJ>) for Nom Sg; if VFIN + Sg3 to the right.
<subjSg (@<SUBJ) for Nom Sg; if VFIN Sg3 or Du2 to the left (no HAB allowed to the left).
f<advl (@-F<ADVL) for infinite adverbials
f<advl (@-F<ADVL) for infinite adverbials
s-boundary=advl> (@ADVL>) for ADVL that resemble s-booundaries. Mainverb to the right.
-fobj> (@-FOBJ>) for Acc
-fobj> (@-FOBJ>) for Acc
advl>mainV (@ADVL>) if; finite mainverb not found to the left, but the finite mainverb is found to the right.
<advl (@<ADVL) if; finite mainverb found to the left. Not if a comma is found immediately to the left and a finite mainverb is located somewhere to the right of this comma.
advlPoPr> (@<ADVL) if mainverb to the right.
advlEss> (@<ADVL) for weather and time Ess, if FMAINV to the left.
advl>inbetween (@ADVL>) for Adv; if inbetween two sentenceboundaries where no mainverb is present.
comma<advlEOS (@<ADVL) if; comma found to the left and the finite mainverb to the left of comma. To the right is the end of the sentence.
advlBOS> (@ADVL>) if; you are N Ill and found sentnece initially. First one to your right is a clause.
<advlPoEOS (@<ADVL) for Po; if you are found at the very end of a sentence. A mainverb is needed to the right though.
cleanupILL<advl (@<ADVL) for N Ill if; there are no boundarysymbols to your left, if you arent already @N< OR @APP-N<, and no mainverb is to yor left.
This (part of) documentation was generated from src/cg3/functions.cg3
LEXICON ab-noun
LEXICON ab-adj
LEXICON ab-adv
LEXICON ab-num
LEXICON ab-nodot-noun The bulk
LEXICON ab-nodot-adj
LEXICON ab-nodot-adv
LEXICON ab-nodot-num
LEXICON ab-dot-noun This is the lexicon for abbrs that must have a period.
LEXICON ab-dot-adj This is the lexicon for abbrs that must have a period.
LEXICON ab-dot-adv This is the lexicon for abbrs that must have a period.
LEXICON ab-dot-num This is the lexicon for abbrs that must have a period.
LEXICON ab-dot-cc
LEXICON ab-dot-verb
LEXICON ab-nodot-verb
LEXICON ab-dot-IVprfprc
LEXICON nodot-attrnomaccgen-infl
LEXICON nodot-attr-infl
LEXICON nodot-nomaccgen-infl
LEXICON dot-attrnomaccgen-infl
LEXICON dot-attr
LEXICON dot-nomaccgen-infl
LEXICON DOT - Adds the dot to dotted abbreviations.
This (part of) documentation was generated from src/fst/morphology/affixes/abbreviations.lexc
LEXICON Q_X
LEXICON A_
LEXICON A_TULLUT IS THIS LIVVI? tullut:tullu
LEXICON A_EX needs to be changed to @exclude=fst
LEXICON A_BAHUV
LEXICON A_UNDECL undeclinable fin: taipumaton
LEXICON A_IS-N-PL-GEN genitive plural attributes
LEXICON A_IS-N-SG-GEN genitive plural attributes
LEXICON A-DEM_NÄMÄ nämä:nämä
LEXICON A_KUU kuu:kuu
LEXICON A_MUA mua:maa
LEXICON A_PIÄ piä:piä
LEXICON A_HYVÄ hyvä:hyvä the comparatives and superlatives are supletive
LEXICON A_OZA oza:oza
LEXICON A_SÄYNÄ säynä:säynä
LEXICON A_KALA kala:kala
LEXICON A_KOIVU koivu:koivu
LEXICON A_HERY hery:hery
LEXICON A_LUGU lugu:lugu
LEXICON A_IDY idy:idy
LEXICON A_HÄKKI häkki:häkki
LEXICON A_ARTELI
LEXICON A_ALUS alus:aluks
LEXICON A_KUURNIS kuurnis:kuurne
LEXICON A_PUHTAHUS puhtahus:puhtahu
LEXICON A_OLUT olut:olu
LEXICON A_PÄIVY päivy:päivä
LEXICON A_MUARJU muarju:muarja
LEXICON A_AKKU akku:akka
LEXICON A_VALGEI valgei:valge
LEXICON A_RAHMANNOI rahmannoi:rahmannoi
LEXICON A_PAGIZII pagizii:pagizi
LEXICON A_KESTÄY kestäy:kestä
LEXICON A_PAREMBI parembi:paremb
LEXICON A_JIÄTÖI jiätöi:jiät LEXICON A_HUOLETOI huoletoi:huolet LEXICON A_HUOLETOI/JIÄTÖI huoletoi:huolet
LEXICON A_KARJALAINE karjalaine:karjala
LEXICON A_LIYGILÄINE liygiläine:liygilä
LEXICON A_NAINE naine:nai LEXICON A_KIELINE kieline:kieli LEXICON A_NAINE/KIELINE_01 kieline:kieli naine:nai
LEXICON A_TOINE toine:to LEXICON A_TOINE-PL toine:to
LEXICON A_MADAL madal:madal
LEXICON A_PIIRAI piirai:piira
LEXICON A_RAIŠ raiš:ra
LEXICON A_PEREH pereh:pereh
LEXICON A_TULLUH tulluh:tullu
LEXICON A_PESSYH pessyh:pessy
LEXICON A_ARMAS armas:arma
LEXICON A_VARVAS varvas:varva
LEXICON A_TUORES tuores:tuore
LEXICON A_SUARI suari:suar
LEXICON A_KIELI kieli:kiel LEXICON A_SUARI/KIELI_01 suari:suar
LEXICON A_VUOZI vuozi:vuod
LEXICON A_VEZI vezi:ved
LEXICON A_NIMI nimi:nim
LEXICON A_JÄLGI jälgi:jälg front vowel gradation Yes
LEXICON A_VAŽEN važen:važe
LEXICON A_LÄMMIN lämmin:lämbi
LEXICON A_TAIGIN taigin:taigin
LEXICON A_SALBOIN salboin:salboi
LEXICON A_ENIN enin:eni
These cases are symmetrically marked for number The next two share the same stem vowel
This (part of) documentation was generated from src/fst/morphology/affixes/adjectives.lexc
Adverbs Olonets-Karelian adverbs compare.
LEXICON ADV-IS-ELA-WITH-PXSG3 e.g. levälleh
This (part of) documentation was generated from src/fst/morphology/affixes/adverbs.lexc
Clitics Livvi clitics
This (part of) documentation was generated from src/fst/morphology/affixes/clitics.lexc
Livvi nouns inflect in cases. Vowel harmony involves front and back Gradation does not affect all consonants, therefore there are three values: Yes, No and NA (not applicable)
LEXICON N_SUO suo:suo Gradation: No Harmony: Back
LEXICON N_VYÖ vyö:vyö Gradation: No Harmony: Front
LEXICON N_KUU kuu:kuu Gradation: No Harmony: Back
LEXICON N_PII pii:pii Gradation: No Harmony: front
LEXICON N_MUA mua:mua Gradation: No Harmony: Back
LEXICON N_PIÄ piä:piä Gradation: No Harmony: Front
LEXICON N_PAPPI pappi:pappi Gradation: Yes Harmony: Back stem final i is retained
LEXICON N_HÄKKI häkki:häkki Gradation: Yes Harmony: Front stem final i is retained
LEXICON N_LEIRI leiri:leiri Gradation NA Harmony: Front stem final i is retained
LEXICON N_PADA pada:pada Gradation Yes Harmony: Back stem final a changes to u in Sg Par stem final a changes to o before i in Pl stem
LEXICON N_KALA kala:kala Gradation NA Harmony: Back stem final a changes to u in Sg Par stem final a changes to o before i in Pl stem
LEXICON N_OZA oza:oza Gradation NA Harmony: Back stem final a changes to u in Sg Par stem final a changes to 0 before i in Pl stem
LEXICON N_SÄYNÄ säynä:säynä Gradation NA Harmony: Front stem final ä changes to i in Sg Par stem final ä changes to 0 before i in Pl stem
LEXICON N_KOIVU koivu:koivu Gradation NA Harmony: Back There are no changes in stem-final u Plural stem in loi
LEXICON N_HERY hery:hery Gradation NA Harmony: Front There are no changes in stem-final y Plural stem in löi
LEXICON N_IDY idy:idy Gradation Yes Harmony: Front There are no changes in stem-final y Plural stem in löi
LEXICON N_LUGU lugu:lugu Gradation Yes Harmony: Back There are no changes in stem-final u Plural stem in loi
LEXICON N_RUNO runo:runo Gradation NA Harmony: Back Stem-fianl o changes to u in Sg Par Plural stem in loi
LEXICON N_HÖRÖ hörö:hörö Gradation NA Harmony: Front
LEXICON N_RUADO ruado:ruado Gradation Yes Harmony: Back
LEXICON N_KYNDÖ kyndö:kyndö Gradation Yes Harmony: Front
LEXICON N_JÄLGI jälgi:jälg Gradation Yes Harmony: Front
LEXICON N_JOGI jogi:jog Gradation Yes Harmony: Back
LEXICON N_MUAMO muamo:muama Gradation NA Harmony: Back
LEXICON N_TUATTO tuatto:tuatta Gradation Yes Harmony: Back
LEXICON N_DIÄDÖ diädö:diädä Gradation No Harmony: Front
LEXICON N_MUARJU muarju:muarja Gradation No Harmony: Back two forms for accusative two forms for elative, ablative phrases
muarju+N+Sg+Nom
muarju+N+Sg+Gen
muarju+N+Sg+Par
muarju+N+Sg+Tra
muarju+N+Sg+Abe
muarju+N+Sg+Com
muarju+N+Sg+Ine
muarju+N+Sg+Ela
muarju+N+Sg+Ill
muarju+N+Sg+Ade
muarju+N+Sg+Abl
muarju+N+Sg+All
muarju+N+Sg+Ess
muarju+N+Sg+Ter
muarju+N+Sg+Apr
muarju+N+Pl+Nom
muarju+N+Pl+Gen
muarju+N+Pl+Par
muarju+N+Pl+Tra
muarju+N+Pl+Abe
muarju+N+Pl+Com
muarju+N+Pl+Ine
muarju+N+Pl+Ela
muarju+N+Pl+Ill
muarju+N+Pl+Ade
muarju+N+Pl+Abl
muarju+N+Pl+All
muarju+N+Pl+Ess
muarju+N+Pl+Ins
muarju+N+Pl+Apr
LEXICON N_PIÄSTÄNDY piäständy:piäständä Gradation No Harmony: Front
LEXICON N_SUAJU suaju:suaja Gradation No Harmony: Back
LEXICON N_AKKU akku:akka Gradation Yes Harmony: Back
LEXICON N_KNIIGU kniigu:kniiga Gradation No Harmony: Back
LEXICON N_SULGU sulgu:sulga Gradation Yes Harmony: Back
LEXICON N_KOIRU koiru:koira Gradation NA Harmony: Back
LEXICON N_NIMI nimi:nim Gradation NA Harmony: Front
LEXICON N_HANGI hangi:hang Gradation NA Harmony: Back
LEXICON N_PÄIVY päivy:päivä Gradation NA Harmony: Front
LEXICON N_MEČČY meččy:meččä Gradation Yes Harmony: Front
LEXICON N_IŽÄNDY ižändy:ižändä Gradation No Harmony: Front
LEXICON N_LATE late:latte Gradation Yes Harmony: Back
LEXICON N_SIVE sive:side Gradation Yes Harmony: Front
LEXICON N_HARDIE hardie:hardie Gradation NA Harmony: Back
LEXICON N_KONDII kondii:kondi Gradation NA Harmony: Back
LEXICON N_STIPENDII stipendii:stipendi Gradation NA Harmony: Back
LEXICON N_REBOI reboi:reboi Gradation No Harmony: Back
LEXICON N_JÄNÖI jänöi:jänöi Gradation No Harmony: Back
LEXICON N_PÖČÖI pöčöi:pöččö Gradation Yes Harmony: Front
LEXICON N_VALGEI valgei:valge Gradation NA Harmony: Back
LEXICON N_LIBEI libei:libe Gradation NA Harmony: Back
LEXICON N_OSTAI ostai:osta Gradation NA Harmony: Back
LEXICON N_PEZII pezii:pezi Gradation NA Harmony: Front
LEXICON N_KESTÄY kestäy:kestä Gradation NA Harmony: Front
LEXICON N_UKSI uksi:uks Gradation NA Harmony: Back
LEXICON N_SUARI suari:suar Harmony: Back LEXICON N_SUARI-PL suari:suar Harmony: Back
LEXICON N_SUARI/KIELI_01 kieli:kiel Gradation No
LEXICON N_KIELI kieli:kiel Harmony: Front
LEXICON N_KIELI-SG kieli:kiel LEXICON N_KIELI-PL kieli:kiel Harmony: Front
LEXICON N_LAPSI lapsi:laps lapsi:laps Gradation NA Harmony: Back
LEXICON N_VEZI vezi:ved Gradation NA Harmony: Front
LEXICON N_SUZI suzi:su Gradation NA Harmony: Back
LEXICON N_VUOZI vuozi:vuod Gradation NA Harmony: Back
LEXICON N_SYGYZY sygyzy:sygyzy Gradation NA Harmony: Front
LEXICON N_VASKIČČU vaskičču:vaskičča Gradation Yes Harmony: Back
LEXICON N_KARJAL karjal:karjal Gradation NA Harmony: Back LEXICON N_KARJAL-SG karjal:karjal LEXICON N_KARJAL-PL karjal:karjal
LEXICON N_MADAL madal:madal Gradation No Harmony: Back LEXICON N_MADAL-SG madal:madal LEXICON N_MADAL-PL madal:madal
LEXICON N_PIIRAI piirai:piira CHECK THIS Gradation NA Harmony: Back
LEXICON N_VERÄI veräi:verä CHECK THIS Gradation NA Harmony: Back
LEXICON N_KANDAI kandai:kanda Gradation NA Harmony: Back
LEXICON N_AVUAJU Gradation NA Harmony: Back
LEXICON N_KERIÄJY piästäjy:piästä Gradation NA Harmony: Front
LEXICON N_PAGIZII pagizii:pagizi Gradation NA Harmony: Back LEXICON N_PAGIZII-SG LEXICON N_PAGIZII-PL
LEXICON N_HUOLETOI huoletoi: huolet Gradation Yes Harmony: Back
LEXICON N_SAMMAL sammal:sammal Gradation No Harmony: Back
LEXICON N_ŠOUFER šoufer:šoufer Vowel Harmony: Back
LEXICON N_VANUIN vanuin:vanui Gradation Yes Harmony: Back Stem consonant n/m Singular stem vowel 0/0/e Plural stem vowel i
LEXICON N_SAMMUTIN sammutin:sammutti Gradation Yes Harmony: Back Stem consonant n/m Singular stem vowel 0/0/e Plural stem vowel i
LEXICON N_KEITIN keitin:keitti Gradation Yes Harmony: Front Stem consonant n/m Singular stem vowel 0/0/e Plural stem vowel i
LEXICON N_LÄMMIN lämmin:lämbi Gradation Yes Harmony: Front
LEXICON N_TAIGIN taigin:taigin Gradation No Harmony: Back
LEXICON N_KARJALAINE karjalaine:karjala Gradation NA Harmony: Back
LEXICON N_LIYGILÄINE liygiläine:liygilä Gradation NA Harmony: Front
LEXICON N_NAINE naine:nai Gradation NA Harmony: Back
LEXICON N_KIELINE kieline:kieli Gradation NA Harmony: Front
LEXICON N_TOINE toine:to Gradation NA Harmony: Back
LEXICON N_RAIŠ raiš:ra Gradation Yes Harmony: Back
LEXICON N_TOVESTEH tovesteh:tovesteh Gradation No Harmony: Back
LEXICON N_PEREH pereh:pereh Gradation No Harmony: Front
LEXICON N_HUIKEH huikeh:huikkeh Gradation Yes Harmony: Front
LEXICON N_LIIKEH liikeh:liikkeh Gradation Yes Harmony: Front
LEXICON N_PENGER penger:penger Gradation No Harmony: Front
LEXICON N_ARTELI arteli:arteli Gradation No Harmony: Back
LEXICON N_PUHTAHUS puhtahus:puhtahu Harmony: Back Gradation NA
LEXICON N_VÄHYS vähys:vähy Harmony: Back Gradation NA
LEXICON N_ALUS alus:aluks Gradation No Harmony: Back
LEXICON N_ALUS-SG alus:aluks
LEXICON N_ALUS-PL alus:aluks
LEXICON N_ILVES ilves:ilveks Gradation No Harmony: Front
LEXICON N_ILVES-SG ilves:ilveks
LEXICON N_ILVES-PL ilves:ilveks
LEXICON N_MIES mies:mie Gradation No Harmony: Front
LEXICON N_MIES-SG mies:mie
LEXICON N_MIES-PL mies:mie
LEXICON N_KUURNIS kuurnis:kuurne Gradation NA Harmony: Back
LEXICON N_SUALIŠ suališ:suali Gradation NA Harmony: Back
LEXICON N_SUALIŠ-SG suališ:suali
LEXICON N_SUALIŠ-PL suališ:suali
LEXICON N_OLUT olut:olu Gradation No Harmony: Back
LEXICON N_KEVÄT kevät:kevä Gradation No Harmony: Front
LEXICON N_TUHAT tuhat:tuha Gradation No Harmony: Back
LEXICON N_ARMAS armas:arma Gradation NA Harmony: Back
LEXICON N_VARVAS varvas:varba Gradation NA Harmony: Back
LEXICON N_PAREMBI parembi:paremb Gradation NA Harmony: Back
LEXICON N_PESSYH pessyh:pessy Gradation NA Harmony: Front
LEXICON N_BEMMEL bemmel:bembel Gradation Yes Harmony: Front
LEXICON N_SUURIM suurim:suurim Gradation NA Harmony: Back
LEXICON N_TUATINDAM tuatindam:tuatindam Gradation NA Harmony: Back
LEXICON N_TUATANDIM tuatandim:tuatandim Gradation NA Harmony: Back
LEXICON N_SIEMEN siemen:siemen Gradation NA Harmony: Front
LEXICON N_SALBOIN salboin:salboi Gradation No Harmony: Back
LEXICON N_UDAR udar:udar Gradation No Harmony: Back
LEXICON N_PIENAR pienar:piendar Gradation Yes Harmony: Back
Back vowel gradation Yes
LEXICON NMN_MUARJU/PIÄSTÄNDY muarju:muarja gradation NA
LEXICON NMN_AKKU akku:akka gradation Yes
LEXICON NMN_KNIIGU kniigu:kniiga gradation No
Front Vowel Gradation Yes
LEXICON NMN_YKSI yksi:y
LEXICON NMN_VUOZI/VEZI vuozi:vuod
LEXICON NMN_PAREMBI/ENÄMBI parembi:paremb
tuores+A+Comp+Sg+Nom
tuores+A+Comp+Comp+Sg+Ess
(is not standard language)LEXICON NMN_KUDAI kudai:kuda
LEXICON NMN_PIIRAI/VERÄI piirai:piira
LEXICON NMN_PAGIZII-SG
LEXICON NMN_PAGIZII-PL
LEXICON NMN_HUOLETOI/JIÄTÖI huoletoi: huolet
Gradation Yes Vowel Harmony Back
LEXICON NMN_KOIRU/PÄIVY koiru:koira Gradation NA Vowel Harmony Back
LEXICON NMN_AVUAJU/KERIÄJY päivy:päivä Gradation NA
gradation Yes
LEXICON NMN_KOIVU/HERY koivu:koivu Gradation NA Vowel Harmony Back
LEXICON NMN_LUGU/IDY lugu:lugu Gradation Yes Vowel Harmony Back
LEXICON NMN_RAHMANNOI rahmannoi:rahmannoi
LEXICON NMN_HARDIE hardie:hardie
LEXICON NMN_KONDII/STIPENDII kondii:kondi
LEXICON NMN_OSTAI ostua:osta%>j
Stem Vowel 00 kandai, kandajan, kandajua, kandajinnu
LEXICON NMN_PEZII pestä:pezi%>j
LEXICON NMN_KESTÄY kestäy:kestä
LEXICON NMN_RUNO/HÖRÖ runo:runo Gradation No
Gradation No
LEXICON NMN_RUADO/KYNDÖ ruado:ruado Gradation Yes
LEXICON NMN_KUU/PII kuu:kuu
LEXICON NMN_PIÄ piä:piä
LEXICON NMN_VYÖ vyö:vyö
LEXICON NMN_MUA mua:mua
LEXICON NMN_KALA kala:kala Gradation NA
LEXICON NMN_PADA pada:pada Yaml: pada Gradation Yes
LEXICON NMN_TULLUH/PESSYH pessyh
LEXICON NMN_TULLUH tulluh:tullu
LEXICON NMN_KARJAL karjal:karjal
LEXICON NMN_KARJAL-SG karjal:karjal LEXICON NMN_KARJAL-PL karjal:karjal
LEXICON NMN_MADAL madal:madal
LEXICON NMN_MADAL-SG madal:madal LEXICON NMN_MADAL-PL madal:madal
LEXICON NMN_ENIN enin:eni
LEXICON NMN_PUHTAHUS/VÄHYS puhtahus:puhtahu
LEXICON NMN_ALUS/ILVES alus:aluks
LEXICON NMN_ARMAS/EVAES armas:arma
LEXICON NMN_VARVAS varvas:varba
LEXICON NMN_VIDEL videl:videl Gradation No
LEXICON NMN_TUORES tuores:tuore
LEXICON NMN_RAIŠ raiš:ra
LEXICON NMN_KUURNIS kuurnis:kuurne
Gradation None
LEXICON NMN_TOVESTEH/PEREH pereh:pereh
LEXICON NMN_ARTELI/LEIRI arteli:arteli
LEXICON NMN_PAPPI/HÄKKI pappi:pappi
LEXICON NMN_REBOI/JÄNÖI reboi:reboi
LEXICON NMN_OZA/SÄYNÄ oza:oza
LEXICON NMN_TVER tver:tver Gradation NA Front Vowel Singular stem vowel 0/i Plural stem vowel il%{oö%}i
Gradation NA Back Vowel Singular stem vowel 0/i Plural stem vowel il%{oö%}i
LEXICON NMN_VAŽEN važen:važe
LEXICON NMN_LÄMMIN lämmin:lämbi
LEXICON NMN_TAIGIN taigin:taigin
luadi%^WGStem%^E1%>tt%{uy%}%>%{uy%} examples:*
luaji000ttu%>u examples:*
LEXICON SGNOM/PXSP3 adding -h
This (part of) documentation was generated from src/fst/morphology/affixes/nouns.lexc
Numeral inflection is like nominal, except that numerals compound in all forms which requires great amount of care in the inflection patterns.
LEXICON ARABICCOMPOUNDS ! 1-osainen
LEXICON ARABICCASES adds +Arab
LEXICON ARABICCASE adds +Arab
LEXICON ARABICCASE0 adds +Arab
This (part of) documentation was generated from src/fst/morphology/affixes/numerals.lexc
Pronoun inflection Livvi pronouns inflect for case.
LEXICON PRON_TÄMÄ tämä:tä
LEXICON PRON-DEM_NÄMMÄ nämmä:nämmä
LEXICON PRON-DEM_NET net:n
LEXICON PRON-INDEF_ Still requires work
LEXICON PRON-INDEF_KEN kentahto:ke
LEXICON PRON-INDEF_KUDAI kudaitahto:kuda
LEXICON PRON-INDEF_MI mitahto:mi
LEXICON PRON-INDEF_MITTUINE mittuinetahto:mittu
LEXICON PRON-INDEF_NIMI nimi:ni«mi
LEXICON PRON-INDEF_NIKEN niken:ni«ke
LEXICON PRON-INDEF_NIMITTUINE nimittuine:ni«mittu
LEXICON PRON-INTERR_ Still requires work
LEXICON PRON-INTERR_MI mi:mi
LEXICON PRON-INTERR_MITTUINE mittuine:mittu
LEXICON PRON-INTERR_KEN ken:ke
LEXICON PRON-REFL_ iče:ičče iččiedäh, iččedäh, iččeh, ičes, iččenäh, iččiedäs, iččeni,
LEXICON PRON-REL_KUDAI kudai:kuda
LEXICON PRON_ARMAS armas:arma
LEXICON PRON_OZA oza:oza
LEXICON PRON-QNT_KIELI kieli:kiel
LEXICON PRON_TOINE toine:to
LEXICON PRON_KAI requires developing
LEXICON PRON_ENÄMBI enämbi:enämb
This (part of) documentation was generated from src/fst/morphology/affixes/pronouns.lexc
Proper noun inflection The LIVVI-KARELIAN language proper nouns inflect in the same cases as regular nouns, but sometimes with a colon (‘:’) as separator.
LEXICON PROP_
LEXICON PROP_VYÖ vyö:vyö
LEXICON PROP_MUA mua:maa
LEXICON PROP_OZA Gradation NA Back vowel Stem vowel a Plural stem in i
LEXICON PROP_OZA_FEM
LEXICON PROP-PLC_OZA
LEXICON PROP_KALA Gradation NA Back vowel Stem vowel a Plural stem in oi
LEXICON PROP_KALA_PATRFEM
LEXICON PROP-PLC_KALA
LEXICON PROP_KALA_SURFEM
LEXICON PROP_PAPPI Gradation Yes Back vowel Stem vowel i Plural stem in iloi
LEXICON PROP_PAPPI-SG
LEXICON PROP_PAPPI-PL
LEXICON PROP-PLC_PAPPI
LEXICON PROP_ARTELI Gradation NA Back vowel Stem vowel i Plural stem in iloi
LEXICON PROP_ARTELI-SG
LEXICON PROP_ARTELI-PL
LEXICON PROP_LEIRI Gradation NA Front vowel Stem vowel i Plural stem in iloi
LEXICON PROP_LEIRI-SG
LEXICON PROP_LEIRI-PL
LEXICON PROP_NIMI nimi:nim Gradation NA Front vowel Stem vowel i/e Plural stem in i
LEXICON PROP_JÄLGI Gradation Yes Vowel Harmony Front Stem Vowel i/0/e Plural stem in i
LEXICON PROP_JÄLGI-SG
LEXICON PROP_JÄLGI-PL
LEXICON PROP_JOGI Gradation Yes Vowel Harmony Back Stem Vowel i/0/e Plural stem in i
LEXICON PROP_JOGI-SG
LEXICON PROP_JOGI-PL
LEXICON PROP_SUARI suari:suar Gradation NA Vowel Harmony Back Stem Vowel i/0/e Plural stem in i
LEXICON PROP_REBOI reboi:reboi Gradation NA Vowel Harmony Back Stem Vowel oi/o Plural stem in oloi
LEXICON PROP_KOIRU koiru:koira Gradation NA Back vowel Stem vowel u/a Plural stem in i
LEXICON PROP-PLC_KOIRU koiru:koira
LEXICON PROP_PÄIVY päivy:päivä Gradation NA Vowel Harmony Front Stem Vowel y/ä Plural stem in i
LEXICON PROP-PLC_KNIIGU kniigu:kniiga Gradation No (looks like it should have gradation) Vowel Harmony Back Stem Vowel u/a Plural stem in oi
LEXICON PROP_MUARJU muarju:muarja Gradation NA Vowel Harmony Back Stem Vowel u:a Plural stem in o
LEXICON PROP-PLC_MUARJU muarju:muarja
LEXICON PROP_AKKU akku:akka Gradation Yes Vowel Harmony Back Stem Vowel u:a Plural stem in o
LEXICON PROP_KOIVU koivu:koivu Back vowel Gradation NA Stem vowel u Plural stem in loi Can be merged with _RUNO
LEXICON PROP_RUNO runo:runo Back vowel Gradation NA Stem vowel o Plural stem in loi
LEXICON PROP_RUADO ruado:ruado Back vowel Gradation Yes Stem vowel o Plural stem in loi
LEXICON PROP-PLC_RUADO ruado:ruado
LEXICON PROP_KYNDÖ kyndö:kyndö Front vowel Gradation Yes Stem vowel o Plural stem in loi
LEXICON PROP_VALGEI Back vowel Gradation NA
LEXICON PROP_VALGEI-SG
LEXICON PROP_VALGEI-PL
LEXICON PROP_KARJAL karjal:karjal Back vowel Gradation NA Singular stem vowel 0/a Plural stem vowel o
LEXICON PROP-MAL_KARJAL
LEXICON PROP_KARJALAINE karjalaine:karjala
LEXICON PROP_KIELINE kieline:kieli
LEXICON PROP-PLC_TVER Tver:Tver
LEXICON PROP-PLC_TAIGIN
LEXICON PROP_PEREH pereh:pereh
LEXICON PROP_VIDEL videl:videl
LEXICON PROP-PLC_ALUS Alus:Aluks
LEXICON PROP_ALUS Alus:Aluks
LEXICON PROP_KONDII kondii:kondi
LEXICON PROP_STIPENDII kondii:kondi
This (part of) documentation was generated from src/fst/morphology/affixes/propernouns.lexc
Quantifier inflection Livvi quantifiers inflect for case.
LEXICON NUM_MUARJU muarju:muarja
LEXICON NUM_MILJOUNU muarju:muarja
LEXICON NUM_YKSI yksi:y
LEXICON NUM_KAKSI kaksi:ka
LEXICON NUM_SEIČČIE seiččie:seičče
LEXICON NUM_NELLI nelli:nell
LEXICON NUM_KAHEKSA kaheksa:kaheksa
LEXICON NUM_YHEKSÄ yheksä:yheksä
LEXICON NUM_TUHAT tuhat:tuha LEXICON NUM_TUHAT_01 tuhat:tuha
LEXICON NUM_KUUZI kuuzi:kuud
LEXICON NUM_VIIZI viizi:viid
LEXICON NUM_KYMMENE kymmene:kymmen LEXICON NUM_KYMMENE_01 kymmene:kymmen
LEXICON NUM_KOLME
LEXICON ORD_TOINE toine:to
LEXICON ORD_KARJALAINE enzimäine:ensimä
LEXICON ORD_LIYGILÄINE enzimäine:ensimä
This (part of) documentation was generated from src/fst/morphology/affixes/quantifiers.lexc
Proper noun inflection
Erzya proper nouns inflect in the same cases as regular nouns.
Vili:Vil
Russian type Surnames Abdʼejev:Abdʼejev
Bagrij:Bagr
Amorskij:Amorsk
DECLENSION LIMITATIONS
This (part of) documentation was generated from src/fst/morphology/affixes/rus-Cyrl-2-Lat-propernouns.lexc
This (part of) documentation was generated from src/fst/morphology/affixes/symbols.lexc
LEXICON V-NEG ei:e
LEXICON V-PROH älä:äl
V1a
LEXICON V_ALENDUA alendua:alenda Harmony: Back
LEXICON V_ALENDUA/ELLENDIÄ_01 ellendiä:ellendä alendua:alenda Gradation No
LEXICON V_LIIKUTTUA liikuttua:liikutta Harmony: Back
LEXICON V_LIIKUTTUA/MENETTIÄ_01 liikuttua:liikutta, menettiä:menettä Gradation YES
LEXICON V_KANDUA_01 kandua:kanda Gradation Yes
LEXICON V_OTTUA/HEITTIÄ_01 ottua:otta Gradation Yes
LEXICON V_OTTUA-SG3 ottua:otta
LEXICON V_OTTUA-3 ottua:otta
LEXICON V_HEITTIÄ heittä:heittä Front vowel
LEXICON V_HEITTIÄ-SG3 heittä:heittä
V1o
LEXICON V_PUNUO/XX_01 punuo:puno Gradation split this 2024-08-19 INDPRSSG1, INDPRSSG2, INDPRSPL1, INDPRSPL2, INDPRSCONNEG_WEAK, INDPRTPL3_Ettih, INDPRTCONNEGPL3_EttU, CONDPRSPL3_EttAs, IMPRT_kkA, PSS-PRSPRC_ttAvU
V1u
LEXICON V_VALUO/PYZYÖ_01 valuo:valu Gradation No similar_to: V_PYZYÖ
LEXICON V_VALUO-SG3 valuo:valu
LEXICON V_PYZYÖ pyzyö:pyzy Front vowel
LEXICON V_PYZYÖ-SG3 pyzyö:pyzy
LEXICON V_PYZYÖ-3 pyzyö:pyzy
V1i
LEXICON V_POTKIE/EČČIE_01 potkie:potki Gradation No Ind+Prt+Sg3
LEXICON V_POTKIE-SG3 potkie:potki
LEXICON V_EČČIE eččie:ečči Front vowel
LEXICON V_LUGIE/IMIE_01 lugie:luge Gradation Yes
:%>e INDPRSPL3_tAh ;
LEXICON V_TUNDIE_01 tundie:tunde Gradation Yes Back vowel
LEXICON V_IMIE imie:ime Gradation No Front vowel
LEXICON V_IMIE-SG3_01 imie:ime
LEXICON V_PIDIÄ_01 pidiä:pidä
LEXICON V_VOIJA_01 voija:voi
+Inf:%>ja K ; type 2 also suaha
+Inf:%>ja K ; type 2 also suaha
+Inf:%>ha K ; type 2 also suaha
LEXICON V_JUVVA_01 juvva:jo
+Inf:%>hä K ; type 2 also jiähä
LEXICON V_MYVVÄ_01 myvvä:mö
LEXICON V_VIIJÄ viijä:ve
LEXICON V_NÄHTÄ_01 nähtä:nä
LEXICON V_TULLA tulla:tul
LEXICON V_ULISTA ulista:ulis
LEXICON V_JUOSTA_01 juosta:juoks
LEXICON V_PAISTA_01 paista:pagis
LEXICON V_PESTÄ pestä:pes
Verbs of the Finnish type 3 but not identical
LEXICON V_KUULTA kuulta:kuul
LEXICON V_KUULTA-SG3 kuulta:kuul
LEXICON V_VIERTÄ viertä:vier
Verbs of the Finnish type 3 but not identical gradation: yes
LEXICON V_KUVITELLA kuvitella:kuvittel
LEXICON V_EZITELLAE ezitellä:ezittel
LEXICON V_TYPÄTÄ typätä:typpä
LEXICON V_KÄITÄ käitä:käge
LEXICON V_SUVATA suvata:suva
gradation: no
gradation: no
gradation: no
LEXICON V_BAUHUTA bauhuta:bauhu
LEXICON V_BAUHUTA-SG3 bauhuta:bauhu
LEXICON V_LANGETA langeta:lange
gradation: no
gradation: yes
gradation: yes
LEXICON V_RUVETA ruveta:rube
LEXICON V_RUVETA-SG3 ruveta:rube
LEXICON V_HYRRÄTÄ hyrrätä:hyrrä
gradation: no
gradation: no
gradation: no
gradation: no
gradation: yes
gradation: yes
gradation: yes
LEXICON V_SUVAIJA suvaija:suvai
LEXICON V_SUVAIJA-SG3 suvaija:suvai
LEXICON V_SUVAIJA-3 suvaija:suvai
LEXICON V_EMÄNDÖIJÄ emändöijä:emändöi
LEXICON V_MAINITA mainita:maini
LEXICON V_ALETA_01 aleta:ale (alenov)
LEXICON V_VÄHETÄ_01 vähetä:vähe (vähenöy)
LEXICON V_VÄHETÄ-SG3 vähetä:vähe (vähenöy)
LEXICON V_SUATA_01 suata:suva (suvaiččov)
LEXICON VR_AMBUO_01 ambuokseh:ambu
LEXICON VR_VALUO/PYZYÖ valuokseh:valu
LEXICON VR_PYZYÖ valuokseh:valu
LEXICON VR_PUNUO/XX_01 punuokseh:puno
LEXICON VR_IMIE imiekseh:ime
LEXICON VR_POTKIE/EČČIE_01 imiekseh:ime
LEXICON VR_POTKIE potkiekseh:potki
LEXICON VR_PIDIÄ_01 pidiäkseh:pidä
LEXICON VR_KANDUA_01 kanduakseh:kanda
LEXICON VR_KANDUA-SG3_01 kanduakseh:kanda
LEXICON VR_OTTUA/HEITTIÄ_01 ottuakseh:otta
LEXICON VR_LIIKUTTUA/XX_01 kirjuttuakseh:kirjutta
Reflexive verbs
LEXICON VR_RODIEKSEH rodiekseh:rodi
LEXICON VR_ROIJA roijakseh:roi
LEXICON VR_ROITA roita:roi
LEXICON VR_MYVVÄ myvvä:mö
LEXICON VR_NÄHTÄ nähtä:nä
LEXICON VR_TULLA tuliekseh:tul !this will need work 2013-10-23
LEXICON VR_PESTÄ tuliekseh:tul !this will need work 2013-10-23
LEXICON VR_KUVITELLA kuvitellakseh:kuvittel !this will need work 2016-08-17
LEXICON VR_EBAEILLAE kuvitellakseh:kuvittel !this will need work 2016-08-17
LEXICON VR_AVATA avatakseh:ava (typätäkseh:typpä)
LEXICON VR_SUVAIJA suvaijakseh:suvai
LEXICON VR_EMÄNDÖIJÄ emändöijäkseh:emändöi
V1 This verb type has two final vowels in the first infinitive
LEXICON V-3SYLL_KIRJUTTUA kirjuttua:kirjutta
LEXICON V-VV_03_Harm-Neutr tulla:tul
LEXICON V-VV_KUULLA kuulla:kuul
LEXICON V-VV_VIERRÄ vierrä:vier
LEXICON V-VV_KUVITELLA/EZITELLAE kuvitella:kuvittel
LEXICON V-VV_KUVITELLA/EZITELLAE suvaija:suvai
LEXICON V-VV_KUVITELLA/EZITELLAE emändöijä:emändöi
LEXICON V-VV_KUVITELLA/EZITELLAE mainita:maini
LEXICON V-VV_KUVITELLA/EZITELLAE merkitä:merki
HOW WILL THESE WORK
HOW WILL THESE WORK
HOW WILL THESE WORK
Forthcoming
… and next chapter
This (part of) documentation was generated from src/fst/morphology/affixes/verbs.lexc
This file documents the phonology.twolc file
a b c č d e f g h i j k l m n o p r s š ş t u v w y z ž ƶ ü ä ö A B C Č D E F G H I J K L M N O P R S Š Ş T U V W Y Z Ž Ƶ Ü Ä Ö y Y
%{aä%}:a %{aä%}:ä
%{aoeInf%}:a Back vowel infinitive marker %{äöeInf%}:ä Front vowel infinitive marker
%{ui%}:i in imperative before %{aä%}:
%{oö%}:o %{oö%}:ö
%{oöØ%}:0 Used in present participle
%{uy%}:y
%{ijPRC%}:j participle
this appears in the illative V1:a V1:e V1:i V1:o V1:u V1:y V1:ä V1:ö
These appear with the inessive and adessive V2:a V2:e V2:i V2:o V2:u V2:y V2:ä V2:ö
These reduplicate the preceding vowel if it in turn is preceded by a consonant V3:a V3:e V3:i V3:o V3:u V3:y V3:ä V3:ö V3:0
%^DEVOICE:0
%^VOICE:0 pestä : pezen
%^SyllBound:0
%^KS2S:0
no change The example here is for something that should not be done We have two infinite sets, Olonets-Karelian and incoming loanwords. The original idea was to make a rule changing all instances of adjacent double aa to ua. For this reason a special trigger was to be inserted into the lexc stem of a word to prevent such a rule from occurring. Since the infinite Olonets-Karelian set is more predictable and perhaps smaller than the incoming loanword set, it is better to literally spell out adjacent vowels that are constant. 2019-09-02 JMR %^NONE:0 This will break vowel change, e.g. sa%^NONEamelaine
%^Pen:0 pagisou paistah in combination with WGStem to trigger
%^D2Z:0 ! The ti => zi
%^E2O:0 the e => o, e => ö
%^Ä2I:0 ä => i %^A2UÄ2I:0 a => u, ä => i
%^ILoss:0 the i => 0 reboi reboloi %^V2U:0 v => y kävvä käydy
_%^RVow:0 _ _%^RmVow:0 _ vowel removal, e.g. with superlative
Triggers dictating right context phenomena
Right context for gradation
Rule: %{aä%}:a kuvitella+V+Inf: imagine/kuvitella
Rule: %{aä%}:ä ezitellä+V+Inf: present/esitellä
Rule: %{aäoeö%}:a
Rule: %{aäoeö%}:ä heittiä+V+Inf: throw/heittää
Rule: %{aäoeö%}:o
Rule: %{aäoeö%}:e
Rule: %{aäoeö%}:ö
Rule: {aäoeöuiy%}:a
Rule: {aäoeöuiy%}:ä
Rule: {aäoeöuiy%}:o
Rule: {aäoeöuiy%}:e
Rule: {aäoeöuiy%}:ö
Rule: {aäoeöuiy%}:u
Rule: {aäoeöuiy%}:i
Rule: {aäoeöuiy%}:y
Rule: e:0
Rule: %{ui%}:u
* %{front%}:0 ! imperative forms
* *kanda%{back%}%>kk%{ui%}%{aä%}mm%{oö%}*
* *kanda0%>kkuammo*
Rule: %{ui%}:i
* %{front%}:0 ! imperative forms
* *lʼykkä%{front%}%^WGStem%>kk%{ui%}%{aä%}mm%{oö%}*
* *lʼyk0ä00%>kkiämmö*
Rule: %{ui%}:i Rule: %{ui%}:i Rule: %{ui%}:i
Rule: %{aäuyiØ%}:a
Rule: %{aäuyiØ%}:u
Rule: %{aäuyiØ%}:ä
Rule: %{aäuyiØ%}:y
Rule: %{aäuyiØ%}:i
Rule: %{aäuyiØ%}:0
a:u before subsequent a Diphthong a+a => ua ostua+V+Inf: buy/ostaa
osta%>%{aä%} examples:*
ostu%>%{aä%} examples:*
ä:i before subsequent A2 Diphthong ä+ä => iä kehittiä+V+Inf: develop/kehittää
kehittä%>%{aä%} examples:*
kehitti%>ä examples:*
o:u before subsequent o Diphthong o+o => uo erota:eruou
ero%>%{oö%}%>n examples:*
eru0o%>n examples:*
runo%>%{oö%} examples:*
runu%>o examples:*
jo%>vv%{aä%} examples:*
ju%>vva examples:*
ö:y before subsequent ö Diphthong ö+ö, ö+%{oö%}:ö => yö
sö%>ö%>y examples:*
sy%>ö%>y examples:*
pöllö%>ö examples:*
pölly%>ö examples:*
sö%>vvä examples:*
sy%>vvä examples:*
e:i before subsequent :e Diphthong o+o => uo erota:eruou
* valge%>e%^WGStem%>t examples:*
* valgi%>e00t examples:*
* hävi%>%{aäPar%}%>n examples:*
* hävi%>e0n examples:*
* tiedo%^WGStem%>n examples:*
* tiijo0%>n examples:*
* *väge%{front%}%^WGStem>n*
* *vä0i00%>n*
* luge>%{ijPRC%} examples:*
* lugi%>j examples:*
e:o Vowel harmony suffixes Back
* luge%^E2O%>u examples:*
* lugo%>u examples:*
e:ö Vowel harmony
käske%^E2O%>y examples:*
käskö0%>y examples:*
Rule: ä:y word final
Rule: ä:ö word final
Rule: a:0 in suaja:sai
* a _ (%{back%}:) %^RmVow: ; Vowel shortening before subsequent "i"
* koira%^RmVow%>i%>en examples:*
* koir00%>i%>en examples:*
* koira0%>j%>en examples:*
* vahna%^RmVow%>in examples:*
* vahn00%>in examples:*
* *otta%{back%}%^WGStem%^RmVow%>in*
* *ot00000%>in*
Rule: i:j
%{ijPRC%}:i
* a _ (%{back%}:) %^RmVow: ; +ActPrsPrc
Rule: a:o in the plural and preterite
a:e pidiä
kanda%^WGStem%^E1%>tt%{aäDiph%}s examples:*
kanne00%>ttas examples:*
kirjava%^E1%>m%^NONEb examples:*
kirjave0%>m0b examples:*
ä:e piettih and in comparatives pidiä
Rule: ä:0 pidiä:pidi
Rule: i:0 reboi:reboloi
%{oö%}:o Vowel harmony suffixes Back %{oö%}:ö Vowel harmony suffixes Front
%{oöØ%}:0 Vowel harmony suffixes Back
%{oöØ%}:o Vowel harmony suffixes Back
luge%>%{ijPRC%}%>%{oöØ%}%>i%>n examples:*
lugi%>j%>o%>i%>s examples:*
%{oöØ%}:ö Vowel harmony suffixes Front
%{uy%}:u Vowel harmony suffixes Back
%{uy%}:y Vowel harmony suffixes Front
Rule: g:j
* *poiga%^WGStem%>n*
* *poija0%>n*
Rule: g:v
jov0e0%>n
g:l
g:r
* särge%^WGStem%>n examples:*
* särre0%>n examples:*
* kergi%^WGStem%>t%{aäPar%} examples:*
* kerri0%>tä examples:*
d:v
d:v <=> [ ö y: | o u: ] _ [ ä: | a ] (HarmDummy:) %^WGStem:0 ;
u a _ o (%{back%}:) %^WGStem:0 ;
a _ u (%{back%}:) %^WGStem:0 ;
[ u o: | u: ] _ [ (%{back%}:) e | a: ] (%{back%}:) ((%^RmVow:) %> i ) %^WGStem:0 ;
[ ä y: ] _ [(%{front%}:) e | y ] (%{front%}:) ((%^RmVow:) %> i ) %^WGStem:0 ;
* *täydy%{front%}%^WGStem*
* *tävvy00*
Rule: v:y d:j
Rule: y:v
täydy%^WGStem examples:*
tävvy0 examples:* lʼöydiä:lʼövvä
Rule: u:v juodu+N+Pl+Ade
o:v Lengthening with Ut:vv weakening
d:z
d:t in partitive
pagis%>d%{aä%}%^WGStemh examples:*
pa0is%>ta0h examples:*
Rule: s:z
* s:z <=> _ (HarmDummy:) %^VOICE:0 ; pestä: pezen
k:g pestä: peskäh
rd:rr weakening
ld:ll weakening
nd:nn weakening mennä+Ind+Prs+ScPl3: mennäh
kanda%^WGStem%^E1%>tt%{aä%}s examples:*
kanne00%>ttas examples:*
rn:rr in prtprc
ln:ll in prtprc
sn:ss in prtprc
Rule: %{dtlnr%}:d
Rule: %{dtlnr%}:t
Rule: %{dtlnr%}:l tulla+Ind+Prs+ScPl3: tullah
Rule: %{dtlnr%}:n
Rule: %{dtlnr%}:r
čToZero
* s:z <=> _ (HarmDummy:) %^VOICE:0 ; suvaija:suvaičen
kToZero aika: ajan
* *lʼykkä%{front%}%^WGStem%>t%{AÄ%}*
* *lʼy0kä00%>tä*
* *abuniekka%{back%}%^A2O%>i%>l*
* *abunie0ko00%>i%>l*
* *liikkeh%{front%}%^WGStem*
* *lii0keh00*
gToZero aika: ajan
ezitellä+V+Inf: present/esitellä
ezi0tel00%>lä0
tavoitteh%^WGStem examples:*
tavoi0teh0 examples:*
b:v
Rule: b:m
b:m <=> m _ [ a | i ] (HarmDummy:) %^WGStem:0 ;
p:0 in lapsi
d:0 pidiä:piän
pidä%^WGStem%>n examples:*
pi0ä0%>n examples:*
s:0
bussa%^WGStem%>l examples:*
bu0sa0%>l examples:*
This (part of) documentation was generated from src/fst/morphology/phonology.twolc
The morphological analyses of wordforms of Livvi are presented in this system in terms of following symbols. (It is highly suggested to follow existing standards when adding new tags).
The parts-of-speech are:
+Coll collective, probably from a common file.
The prefix (before “/”) is Cmp.
The Usage extents are marked using following tags:
The nominals are inflected in the following Case and Number
The possession is marked as such:
The comparative forms are:
Numerals and Quantifiers are classified under:
+Iter Iterative form expressing number of times; myv: кавксть
, kpv: кыкысь
Verb tenses are: | +Prs | Present, non-past Tense
Verb moods are:
Verb personal forms are: +Rfl : This is a work around for olo passive. Olo has a passive conjugation, whereas Finnish and Estonian do not. Other verb forms are +Inf : Infinitive +Act : active voice +Pss : passive voice +PrfPrc : past participle +PrsPrc : present participle +RcPrfPrc : reflexive past participle +Ger : Gerund
+ConNeg +ConNegII +Neg +ImprtII +PrfPrcPl3 +Sup +VGen +VAbess
+ABBR +ACR
Non-dictionary words can be recognised with: +Guess
Question and Focus particles:
+Clt/bo
Pmatch 2021-03-13
semantic types of adverbs
Semantics are classified with
Derivations are classified under the morphophonetic form of the suffix, the source and target part-of-speech.
To represent phonologic variations in word forms we use the following symbols in the lexicon files:
%{aoeInf%} Back vowel infinitive marker %{äöeInf%} Front vowel infinitive marker
And following triggers to control variation
%^A2UÄ2Y with single vowels %^E2O the e => o, e => ö +Ind+Prs+ScSg3 %^Ä2I ä => i %^A2UÄ2I a => u, ä => i in consecutive vowels %^V2U v => y kävvä käydy
**%[%>%]** - Literal >
**%[%<%]** - Literal <
These are for developing underlying morphology rules
(escaped with square brackets, to avoid collision with > as morpheme boundary)
We have manually optimised the structure of our lexicon using following flag diacritics to restrict morhpological combinatorics - only allow compounds with verbs if the verb is further derived into a noun again: | @P.NeedNoun.ON@ | (Dis)allow compounds with verbs unless nominalised | @D.NeedNoun.ON@ | (Dis)allow compounds with verbs unless nominalised | @C.NeedNoun@ | (Dis)allow compounds with verbs unless nominalised
For languages that allow compounding, the following flag diacritics are needed to control position-based compounding restrictions for nominals. Their use is handled automatically if combined with +CmpN/xxx tags. If not used, they will do no harm. | @P.CmpFrst.FALSE@ | Require that words tagged as such only appear first | @D.CmpPref.TRUE@ | Block such words from entering ENDLEX | @P.CmpPref.FALSE@ | Block these words from making further compounds | @D.CmpLast.TRUE@ | Block such words from entering R | @D.CmpNone.TRUE@ | Combines with the next tag to prohibit compounding | @U.CmpNone.FALSE@ | Combines with the prev tag to prohibit compounding | @P.CmpOnly.TRUE@ | Sets a flag to indicate that the word has passed R | @D.CmpOnly.FALSE@ | Disallow words coming directly from root.
Use the following flag diacritics to control downcasing of derived proper nouns (e.g. Finnish Pariisi -> pariisilainen). See e.g. North Sámi for how to use these flags. There exists a ready-made regex that will do the actual down-casing given the proper use of these flags. | @U.Cap.Obl@ | Allowing downcasing of derived names: deatnulasj. | @U.Cap.Opt@ | Allowing downcasing of derived names: deatnulasj.
Flags used to identify parts of speech
cardinal vs ordinal
Orthographical errors? 2021-03-13
Pmatch 2021-03-13
Removal
| Flag diacritic | Explanation | :————- |:———– | @U.number.one@ | Flag used to give arabic numerals in smj different cases ; | @U.number.two@ | Flag used to give arabic numerals in smj different cases ; | @U.number.three@ | Flag used to give arabic numerals in smj different cases ; | @U.number.four@ | Flag used to give arabic numerals in smj different cases ; | @U.number.five@ | Flag used to give arabic numerals in smj different cases ; | @U.number.six@ | Flag used to give arabic numerals in smj different cases ; | @U.number.seven@ | Flag used to give arabic numerals in smj different cases ; | @U.number.eight@ | Flag used to give arabic numerals in smj different cases ; | @U.number.nine@ | Flag used to give arabic numerals in smj different cases ; | @U.number.zero@ | Flag used to give arabic numerals in smj different cases ;
NEWWORDS FILES A_NEWWORDS ; adjectives ADV_NEWWORDS ; adverbs N_NEWWORDS ; nouns PROP_NEWWORDS ; proper nouns V_NEWWORDS ; verbs
I INCLUDE SOME SMALL LEXICA HERE WAITING FOR OWN FILES, OR PERHAPS THEY COULD STAY HERE
This (part of) documentation was generated from src/fst/morphology/root.lexc
This is where new words are added as lexc entries before they are added to the xml source files. lyhyt+A:lyhy A_ “/(eng) short/(fin) lyhyt” ;
ADD ADJECTIVES BELOW!
This (part of) documentation was generated from src/fst/morphology/stems/adjectives_newwords.lexc
This is where new words are added as lexc entries before they are added to the xml source files. rounu+Adv:rounu ADV_ “/(eng) /(fin) tasan” ;
ADD ADVERBS BELOW!
This (part of) documentation was generated from src/fst/morphology/stems/adverbs_newwords.lexc
Special verbal forms
ADJECTIVES
Adverbs incomplete #päivy+N+Sg+Ess+Adv+Adv:#piän This should help in compound words Essive
Conjunctors incomplete
MISC
POSTPOSITIONS
Pronouns
Proper given names female
Proper given names male
Place names
Nouns incomplete
identitiettu iänenando iänenandolippu julgavot Jougamoine kirjutannuh koudu kulʼtuurutego livgiläine nielenvaldu muasku noumer partnʼourat piäkirjutus politiekku pyhänpiän sananparzi sportu immigrantat valličendupäivänny valličenduvirguniekku valliiskoikieline Wikipedii
Väinämöine Verbs INCOMPLETE
PROPER NOUNS FROM OLONETS
Undentified Morph
This (part of) documentation was generated from src/fst/morphology/stems/exceptions.lexc
This is where new words are added as lexc entries before they are added to the xml source files. pappi+N:pappi N_PAPPI “/(eng) priest/(fin) pappi” ;
ERRONEOUS FORMS sluudielaine+N:sluudiela : stuudielaine oldihanukselaine+N:oldihanuksela : oldih_anukselazet
ADD NOUNS BELOW!
This (part of) documentation was generated from src/fst/morphology/stems/nouns_newwords.lexc
Numerals Numerals in the Livvi language are numbers.
Numerals have been split in three sections, the compounding parts of cardinals and ordinals, and the non-compounding ones:
kaksi+Num+Sg+Nom#kymmenen+Num+Sg+Par#kolme+Num+Sg+Nom#tuhat+Num+Sg+Par
(Eng. ! 23,000)kaksi+Num+Sg+Nom#kymmenen+Num+Sg+Par#kolme+Num+Sg+Nom#tuhat+Num+Sg+Par
kahđes+A+Ord+Sg+Nom#sađas+A+Ord+Sg+Nom#neljes+A+Ord+Sg+Nom
(Eng. ! 204rd)viitisen+Num#kymmentä
(Eng. ! 50-ish)This (part of) documentation was generated from src/fst/morphology/stems/numerals.lexc
Prefixes Prefixes in the Livvi language are bound to beginning of other words.
This (part of) documentation was generated from src/fst/morphology/stems/prefixes.lexc
This is where new words are added as lexc entries before they are added to the xml source files. merki:merki PROP_ “/(eng) mark/(fin) merkki” ;
ADD NOUNS BELOW!
This (part of) documentation was generated from src/fst/morphology/stems/propernouns_newwords.lexc
Male given names that are used for deriving patronymics
Francʼ+N+Prop:Franc
Vili+N+Prop:Vil
FEMALE GIVEN NAMES
This (part of) documentation was generated from src/fst/morphology/stems/rus-Cyrl-2-Lat-propernouns.lexc
This is where new words are added as lexc entries before they are added to the xml source files. kandua+V:kanda V_KANDUA “/(eng) carry/(fin) kantaa” ;
ADD VERBS BELOW! These Below exist in xml but lack Finnish translation
This (part of) documentation was generated from src/fst/morphology/stems/verbs_newwords.lexc
retroflex plosive, voiceless t ʈ 0288, 648 (
= ASCII 096)
retroflex plosive, voiced d ɖ 0256, 598
labiodental nasal F ɱ 0271, 625
retroflex nasal n
ɳ 0273, 627
palatal nasal J ɲ 0272, 626
velar nasal N ŋ 014B, 331
uvular nasal N\ ɴ 0274, 628
bilabial trill B\ ʙ 0299, 665
uvular trill R\ ʀ 0280, 640
alveolar tap 4 ɾ 027E, 638
retroflex flap r ɽ 027D, 637
bilabial fricative, voiceless p\ ɸ 0278, 632
bilabial fricative, voiced B β 03B2, 946
dental fricative, voiceless T θ 03B8, 952
dental fricative, voiced D ð 00F0, 240
postalveolar fricative, voiceless S ʃ 0283, 643
postalveolar fricative, voiced Z ʒ 0292, 658
retroflex fricative, voiceless s
ʂ 0282, 642
retroflex fricative, voiced z` ʐ 0290, 656
palatal fricative, voiceless C ç 00E7, 231
palatal fricative, voiced j\ ʝ 029D, 669
velar fricative, voiced G ɣ 0263, 611
uvular fricative, voiceless X χ 03C7, 967
uvular fricative, voiced R ʁ 0281, 641
pharyngeal fricative, voiceless X\ ħ 0127, 295
pharyngeal fricative, voiced ?\ ʕ 0295, 661
glottal fricative, voiced h\ ɦ 0266, 614
alveolar lateral fricative, vl. K alveolar lateral fricative, vd. K\
labiodental approximant P (or v) alveolar approximant r\ retroflex approximant r` velar approximant M\
retroflex lateral approximant l`
palatal lateral approximant L
velar lateral approximant L
Clicks
bilabial O\ (O = capital letter)
dental |
(post)alveolar !\
palatoalveolar =\
alveolar lateral ||
Ejectives, implosives
ejective > e.g. ejective p p> implosive < e.g. implosive b b< Vowels
close back unrounded M close central unrounded 1 close central rounded } lax i I lax y Y lax u U
close-mid front rounded 2 close-mid central unrounded @\ close-mid central rounded 8 close-mid back unrounded 7
schwa ə @
open-mid front unrounded E open-mid front rounded 9 open-mid central unrounded 3 open-mid central rounded 3\ open-mid back unrounded V open-mid back rounded O
ash (ae digraph) { open schwa (turned a) 6
open front rounded & open back unrounded A open back rounded Q Other symbols
voiceless labial-velar fricative W voiced labial-palatal approx. H voiceless epiglottal fricative H\ voiced epiglottal fricative <\ epiglottal plosive >\
alveolo-palatal fricative, vl. s\ alveolo-palatal fricative, voiced z\ alveolar lateral flap l\ simultaneous S and x x\ tie bar _ Suprasegmentals
primary stress “
secondary stress %
long :
half-long :\
extra-short _X
linking mark -
Tones and word accents
level extra high _T level high _H level mid _M level low _L level extra low _B downstep ! upstep ^ (caret, circumflex)
contour, rising contour, falling _F contour, high rising _H_T contour, low rising _B_L
contour, rising-falling _R_F
(NB Instead of being written as diacritics with _, all prosodic
marks can alternatively be placed in a separate tier, set off
by < >, as recommended for the next two symbols.)
global rise
voiceless 0 (0 = figure), e.g. n_0 voiced _v aspirated _h more rounded _O (O = letter) less rounded _c advanced _+ retracted _- centralized _” syllabic = (or _=) e.g. n= (or n=) non-syllabic _^ rhoticity `
breathy voiced _t creaky voiced _k linguolabial _N labialized _w palatalized ‘ (or _j) e.g. t’ (or t_j) velarized _G pharyngealized _?\
dental d apical _a laminal _m nasalized ~ (or _~) e.g. A~ (or A~) nasal release _n lateral release _l no audible release _}
velarized or pharyngealized _e velarized l, alternatively 5 raised _r lowered _o advanced tongue root _A retracted tongue root _q
This (part of) documentation was generated from src/fst/phonetics/txt2ipa.xfscript
We describe here how abbreviations are in Livvi are read out, e.g. for text-to-speech systems.
For example:
This (part of) documentation was generated from src/fst/transcriptions/transcriptor-abbrevs2text.lexc
O L O N E T S I A N G R A M M A R C H E C K E R
This section lists all the tags inherited from the fst, and used as tags in the syntactic analysis. The next section, Sets, contains sets defined on the basis of the tags listed here, those set names are not visible in the output.
BOS EOS
N A Adv V Pron CS CC CC-CS Po Pr Pcle Num Interj ABBR ACR CLB LEFT RIGHT WEB PPUNCT PUNCT
COMMA ¶
Pers Dem Interr Indef Recipr Refl Rel Coll NomAg Prop Allegro Arab Romertall
Nom Acc Gen Ill Loc Com Ess Ess Sg Du Pl Cmp/SplitR Cmp/SgNom Cmp/SgGen Cmp/SgGen PxSg1 PxSg2 PxSg3 PxDu1 PxDu2 PxDu3 PxPl1 PxPl2 PxPl3 Px
Comp Superl Attr Ord Qst IV TV Prt Prs Ind Pot Cond Imprt ImprtII Sg1 Sg2 Sg3 Du1 Du2 Du3 Pl1 Pl2 Pl3 Inf ConNeg Neg PrfPrc VGen PrsPrc Ger Sup Actio VAbess
Err/Orth
Sem/Act Sem/Ani Sem/Atr Sem/Body Sem/Clth Sem/Domain Sem/Feat-phys Sem/Fem Sem/Group Sem/Lang Sem/Mal Sem/Measr Sem/Money Sem/Obj Sem/Obj-el Sem/Org Sem/Perc-emo Sem/Plc Sem/Sign Sem/State-sick Sem/Sur Sem/Time Sem/Txt
HUMAN
PROP-ATTR PROP-SUR
TIME-N-SET
@+FAUXV @+FMAINV @-FAUXV @-FMAINV @-FSUBJ> @-F<OBJ @-FOBJ> @-FSPRED<OBJ @-F<ADVL @-FADVL> @-F<SPRED @-F<OPRED @-FSPRED> @-FOPRED> @>ADVL @ADVL< @<ADVL @ADVL> @ADVL @HAB> @<HAB @>N @Interj @N< @>A @P< @>P @HNOUN @INTERJ @>Num @Pron< @>Pron @Num< @OBJ @<OBJ @OBJ> @OPRED @<OPRED @OPRED> @PCLE @COMP-CS< @SPRED @<SPRED @SPRED> @SUBJ @<SUBJ @SUBJ> SUBJ SPRED OPRED @PPRED @APP @APP-N< @APP-Pron< @APP>Pron @APP-Num< @APP-ADVL< @VOC @CVP @CNP OBJ