South Sámi NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-sma

Documenting the South Saami lexicon file

The nouns

The noun stems are stored in gt/sma/noun-sma-lex.txt, whereas the morphology is found in gt/sma/sma-lex.txt. It was made by converting the original Moshagen / Trosterud sma lexicon to the Xerox format. The lexical rules are taken from Karttunen’s alternative formulation of our file. His original is found in the archive of original files (Contact Trond for reference, if needed).

The nouns are divided into three stem classes, the N_IE nouns (gåetie, etc.), the N_OE nouns (bearkoe, etc) and the N_OTHER nouns (all other ones, bi- and trisyllabic alike).

The case forms fall in three different groups: Forms unique to each of the three stem classes (listed under each sublexicon), forms with -j- in the N_OE class and -i- in the other ones (with separate i- and j- sublexica and a common suffix lexicon), and forms common to all stem classes (covered in a common continuation lexicon).

Here is a list of the lexica (to be documented)

 N_ODD
 N_ODD_NODISIMP
 ÅABPETJH        !default N_ODD plural lexicon
 N_ODD_C         !these words have consonant-ending in nominative
 AAJEGE          !Sg+Nom=Aajege/Aajeh  Sg+Cmp=Aajeh-
 AAREGE          !Sg+Nom=Aarege/Aareh  Sg+Cmp=Aarege-/Aareh-
 BAARTEGE        !Sg+Nom=baartege/baarth  Sg+Cmp=baartege-/baarth-
 GAAJSEGE        !Sg+Nom=gaajsege/gaajsh  Sg+Cmp=gaajsh-
 LAADTEGE        !Sg+Nom=laadtege  Sg+Cmp=laadth-
 SAADTEGE        !Sg+Nom=saadtege  Sg+Cmp=saadtege-/saadth-
 LEEJJEGE        !Sg+Nom=leejjege  Sg+Cmp=leejjeh-
 DEAKEHKE        !Sg+Nom=deakehke/deakah  Sg+Cmp=deakehke-/deakah-
 ÅERUVE
 BÅERUVE
 VUANOVE
 BÅERUJE
 IJE_ODD
 DAKTERE
 N_IE
 VUELIE
 TJIDTJIE
 TJÅENIEH       !ie plural lexicon
 N_OE_UML
 N_OE
 LAAHKOE
 GAAROEH        !-oe plural
 MAANA
 AAHKA
 NIEJTE
 MAAKE
 JOVKEMES 

The adjectives

The continuation lexica are built on the following convention:

attrsuffix_PREDSUFFIX_STEMTYPECOMPTYPE

A letter C in the beginning of the suffix marks consonant. There may be more than one suffix both for attr and PRED, thereby the difference small/capital letters. Example:

faelskies+CmpN/SgN+CmpN/SgG+CmpN/PlG:faelsk ies_IES_IE_EVEN

This is an even-syllabic adjective, with -ies attributive and -ies or -ie in predicative. Since nothing is said about comparative forms, it has normal comparative and superlative inflection.

The verbs

The auxiliary lea and the negative verbs have been added. These verbs are irregular, and have thus been added without the use of any morphophonological processes.

To be written: Documentation for verb lexica.

The adpositions and prepositions

The adpositions in Bergsland’s grammar have been listed, in two groups, pure postpositions and combined pre/postpositions (named “adpositions”). Other adpositions ahve been added.

Sitemap

Debugging site.pages:

URL: /assets/css/style.css - Title:

URL: /ConvertingToApertium.html - Title:

URL: /KompilereFST.html - Title:

URL: /Links.html - Title:

URL: /adj-meeting-05-2009.html - Title: Stoda no

URL: /docu-sma-adjs.html - Title: Sørsamiske adjektiv, system

URL: /docu-sma-background.html - Title: Background information on the South Saami project

URL: /docu-sma-bugs.html - Title: Bug reports, errors

URL: /docu-sma-deptags.html - Title: South Saami dependency tags

URL: /docu-sma-grammartags.html - Title: Overview

URL: /docu-sma-lex.html - Title: Documenting the South Saami lexicon file

URL: /docu-sma-morphophonology.html - Title: South Saami morphophonological processes

URL: /docu-sma-testplan.html - Title: Test plan for sma

URL: /docu-sma-twol.html - Title: Documentation of South Saami rules

URL: /docu-sma-verbs.html - Title: Souths Saami verb morphology

URL: /gramcheck/collecting-developer-texts.html - Title: Collecting developer texts

URL: /gramcheck/ - Title: Grammar checker for South Saami

URL: /index-header.html - Title: South Sámi documentation

URL: / - Title: South Sámi documentation

URL: /lemma.html - Title: Prinsipp for lemmatisering av sørsamisk

URL: /normativity-issues.html - Title: Background

URL: /sma-korpus-innsamling.html - Title: Korpusmøte for sma

URL: /sma-testdiary.html - Title: Test results for the morphology and lexicon files

URL: /sma.html - Title: South Sámi language model documentation

URL: /sma_lemma.freq.html - Title:

URL: /sma_wf.freq.html - Title:

URL: /src-cg3-disambiguator.cg3.html - Title: S O U T H   S Á M I   D I S A M B I G U A T O R

URL: /src-cg3-valency.cg3.html - Title: S O U T H   S Á M I   V A L E N C Y A N N O T A T O R

URL: /src-fst-morphology-affixes-abbreviations.lexc.html - Title: Continuation lexicons for abbreviations

URL: /src-fst-morphology-affixes-adjectives.lexc.html - Title: Adjective affixes

URL: /src-fst-morphology-affixes-nouns.lexc.html - Title: Nominal inflection sublexica

URL: /src-fst-morphology-affixes-possessive-suffixes.lexc.html - Title:

URL: /src-fst-morphology-affixes-propernouns.lexc.html - Title: Proper nouns morphology

URL: /src-fst-morphology-affixes-symbols.lexc.html - Title: Symbol affixes

URL: /src-fst-morphology-affixes-verbs.lexc.html - Title: South Saami verbal inflection sublexica

URL: /src-fst-morphology-compounding.lexc.html - Title: South Sámi morphological analyser

URL: /src-fst-morphology-phonology.twolc.html - Title: South Sámi morphophonological rule set

URL: /src-fst-morphology-root.lexc.html - Title: South Sámi morphological analyser

URL: /src-fst-morphology-stems-adjectives.lexc.html - Title: Adjective stems

URL: /src-fst-morphology-stems-adverbs.lexc.html - Title:

URL: /src-fst-morphology-stems-nouns.lexc.html - Title: South Sámi nouns

URL: /src-fst-morphology-stems-numerals.lexc.html - Title:

URL: /src-fst-morphology-stems-pronouns.lexc.html - Title: South Saami pronouns

URL: /src-fst-morphology-stems-sma-propernouns.lexc.html - Title:

URL: /src-fst-morphology-stems-verbs.lexc.html - Title: Verb stems

URL: /src-fst-oahpa-filer-aff-adjectives-oahpa.lexc.html - Title: Adjective affixes

URL: /src-fst-oahpa-filer-stems-adjectives-oahpa.lexc.html - Title: Adjective stems

URL: /src-fst-phonetics-txt2ipa.xfscript.html - Title:

URL: /src-fst-transcriptions-transcriptor-abbrevs2text.lexc.html - Title:

URL: /src-fst-transcriptions-transcriptor-symbols2text.lexc.html - Title:

URL: /syntaks-testing.html - Title: Syntaks-testmateriale

URL: /tools-grammarcheckers-grammarchecker.cg3.html - Title:

URL: /tools-grammarcheckers-grc-disambiguator.cg3.html - Title: S O U T H   S Á M I   D I S A M B I G U A T O R

URL: /tools-tokenisers-tokeniser-disamb-gt-desc.pmscript.html - Title: Tokeniser for sma

URL: /tools-tokenisers-tokeniser-gramcheck-gt-desc.pmscript.html - Title: Grammar checker tokenisation for sma

URL: /tools-tokenisers-tokeniser-tts-cggt-desc.pmscript.html - Title: TTS tokenisation for smj

Root items:

URL: /ConvertingToApertium.html - Title: Convertingtoapertium

URL: /KompilereFST.html - Title: Kompilerefst

URL: /Links.html - Title: Links

URL: /adj-meeting-05-2009.html - Title: Stoda no

URL: /docu-sma-adjs.html - Title: Sørsamiske adjektiv, system

URL: /docu-sma-background.html - Title: Background information on the South Saami project

URL: /docu-sma-bugs.html - Title: Bug reports, errors

URL: /docu-sma-deptags.html - Title: South Saami dependency tags

URL: /docu-sma-grammartags.html - Title: Overview

URL: /docu-sma-lex.html - Title: Documenting the South Saami lexicon file

URL: /docu-sma-morphophonology.html - Title: South Saami morphophonological processes

URL: /docu-sma-testplan.html - Title: Test plan for sma

URL: /docu-sma-twol.html - Title: Documentation of South Saami rules

URL: /docu-sma-verbs.html - Title: Souths Saami verb morphology

URL: /gramcheck/ - Title: Grammar checker for South Saami

URL: /index-header.html - Title: South Sámi documentation

URL: / - Title: South Sámi documentation

URL: /lemma.html - Title: Prinsipp for lemmatisering av sørsamisk

URL: /normativity-issues.html - Title: Background

URL: /sma-korpus-innsamling.html - Title: Korpusmøte for sma

URL: /sma-testdiary.html - Title: Test results for the morphology and lexicon files

URL: /sma.html - Title: South Sámi language model documentation

URL: /sma_lemma.freq.html - Title: Sma_lemma.freq

URL: /sma_wf.freq.html - Title: Sma_wf.freq

URL: /src-cg3-disambiguator.cg3.html - Title: S O U T H   S Á M I   D I S A M B I G U A T O R

URL: /src-cg3-valency.cg3.html - Title: S O U T H   S Á M I   V A L E N C Y A N N O T A T O R

URL: /src-fst-morphology-affixes-abbreviations.lexc.html - Title: Continuation lexicons for abbreviations

URL: /src-fst-morphology-affixes-adjectives.lexc.html - Title: Adjective affixes

URL: /src-fst-morphology-affixes-nouns.lexc.html - Title: Nominal inflection sublexica

URL: /src-fst-morphology-affixes-possessive-suffixes.lexc.html - Title: Src-fst-morphology-affixes-possessive-suffixes.lexc

URL: /src-fst-morphology-affixes-propernouns.lexc.html - Title: Proper nouns morphology

URL: /src-fst-morphology-affixes-symbols.lexc.html - Title: Symbol affixes

URL: /src-fst-morphology-affixes-verbs.lexc.html - Title: South Saami verbal inflection sublexica

URL: /src-fst-morphology-compounding.lexc.html - Title: South Sámi morphological analyser

URL: /src-fst-morphology-phonology.twolc.html - Title: South Sámi morphophonological rule set

URL: /src-fst-morphology-root.lexc.html - Title: South Sámi morphological analyser

URL: /src-fst-morphology-stems-adjectives.lexc.html - Title: Adjective stems

URL: /src-fst-morphology-stems-adverbs.lexc.html - Title: Src-fst-morphology-stems-adverbs.lexc

URL: /src-fst-morphology-stems-nouns.lexc.html - Title: South Sámi nouns

URL: /src-fst-morphology-stems-numerals.lexc.html - Title: Src-fst-morphology-stems-numerals.lexc

URL: /src-fst-morphology-stems-pronouns.lexc.html - Title: South Saami pronouns

URL: /src-fst-morphology-stems-sma-propernouns.lexc.html - Title: Src-fst-morphology-stems-sma-propernouns.lexc

URL: /src-fst-morphology-stems-verbs.lexc.html - Title: Verb stems

URL: /src-fst-oahpa-filer-aff-adjectives-oahpa.lexc.html - Title: Adjective affixes

URL: /src-fst-oahpa-filer-stems-adjectives-oahpa.lexc.html - Title: Adjective stems

URL: /src-fst-phonetics-txt2ipa.xfscript.html - Title: Src-fst-phonetics-txt2ipa.xfscript

URL: /src-fst-transcriptions-transcriptor-abbrevs2text.lexc.html - Title: Src-fst-transcriptions-transcriptor-abbrevs2text.lexc

URL: /src-fst-transcriptions-transcriptor-symbols2text.lexc.html - Title: Src-fst-transcriptions-transcriptor-symbols2text.lexc

URL: /syntaks-testing.html - Title: Syntaks-testmateriale

URL: /tools-grammarcheckers-grammarchecker.cg3.html - Title: Tools-grammarcheckers-grammarchecker.cg3

URL: /tools-grammarcheckers-grc-disambiguator.cg3.html - Title: S O U T H   S Á M I   D I S A M B I G U A T O R

URL: /tools-tokenisers-tokeniser-disamb-gt-desc.pmscript.html - Title: Tokeniser for sma

URL: /tools-tokenisers-tokeniser-gramcheck-gt-desc.pmscript.html - Title: Grammar checker tokenisation for sma

URL: /tools-tokenisers-tokeniser-tts-cggt-desc.pmscript.html - Title: TTS tokenisation for smj

Directory items:

URL: /gramcheck/collecting-developer-texts.html - Title: Collecting developer texts