South Sámi NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-sma

S O U T H   S Á M I   D I S A M B I G U A T O R

Delimiters, tags and sets

"<.>" "<!>" "<?>" "<...>" "<¶>" sent

Tags

BOS/EOS:

Morphological tags

Derivation tags

Error usage tags

We define two lists for Err/xxx tags:

Other tags

Semantic tags

Secondary tags

Syntactic tags

Titles

REAL-TITLE OFFICE TITLE

Sets

Sets of morphological tags for syntactic use

CASES ADVLCASE NUMBER

Noun sets

INSTITUTION ORGANIZATION EDUCATION CURRENCY CURRENCY LESSON

Verb sets

REALCOPULAS

COPULAS

V-NOT-COP

MOD-ASP

Adjective sets

Adverb sets

GUKTIEGOSSE

DAESTIE

ILLADV

INEADV1

ELAADV1

INEADV

ELAADV

DV-MOD-ADV

Postposition sets

ILLPO

BOUNDARY SETS

REALCLB

SV-BOUNDARY

NP-BOUNDARY

Derivation sets

V-DER

V-DER-SUF

N-DER N-DER-SUF

A-DER A-DER-SUF

PASS

LEX-V LEX-N LEX-A LEX-ADV

VERB-FORMS 2-PERS

Disambiguation rules

BEFORE-SECTIONS

Rule for adding Sem/Date as a tag to readings which looks like dates (fjernes når vi får felles numeralfil fra shared)

Guessing: Rule for adding Adv Sem/Adr as a tag to readings which looks addresses

Guessing: Rule for adding Adv Sem/Adr as a tag to readings which looks addresses

Rules for adding to verbs denoting verbal actions like: ... jeahta Aili Kestkitalo.

SECTION

Cycle 0 (Early rules)

Removing non-lexicalised forms when lexicalised

Numerals and ACR

Numerals in QPs

CC og not (spesifikke regler lenger ned)

Interj

Possessive suffix

REmove Px if not family

Pronouns

Proper nouns

INITIAL

Verbs

Postpositions

Selecting postpositions when preceded by genitives, etc.

Particles and adverbs

Adjective or Indef

Demonstratives

Genitive

Adjective or not

Rel or Interr OR Indef

Adverbs

Selecting adverbs in local contexts

Verbs

Selecting verbs in local contexts, based upon agreement patterns

Selecting imperative sentence-initially with appropriate right context

Remove verb readings

Select Inf

Mapping rules

CC- and CS-Mapping

CNP mapping

Mapping CNP to CC and CS.

CVP Mapping

Mapping @CVP to all CS

Attributes or not

PrfPrc

Select PrfPrc if DerNomAct

Mapping verbs

killifVinCohort

This rule removes all other readings, if there is a mapped V reading in the same cohort. Every case which this goes wrong, should be fixed in mapping rules or previous disrules.

Person

leah Prs Sg2 = Pl3

Select Inf If Infv

Span sentences

Nomen

Remove Prop Attr if not 1 Prop

Verb or Noun

CC and CS or Adv

Adj or Adv

Grammatisk ord eller N eller A

N or V

Ger or Der/NomAct

Adj or Indef

Num

Adv or Po/Pr

Illative or genetive

Essive

Comitative

Accusative or illative

Indef or Adv

special lemmas

Adverb context prefers Adv

Verb person vs. Inf – moved here in order to have the pronouns disambiguated first.

Proper nouns

Rule set taken from sme

gellie as numeral, not pronoun


This (part of) documentation was generated from src/cg3/disambiguator.cg3

Sitemap

Debugging site.pages:

URL: /assets/css/style.css - Title:

URL: /ConvertingToApertium.html - Title:

URL: /KompilereFST.html - Title:

URL: /Links.html - Title:

URL: /adj-meeting-05-2009.html - Title: Stoda no

URL: /docu-sma-adjs.html - Title: Sørsamiske adjektiv, system

URL: /docu-sma-background.html - Title: Background information on the South Saami project

URL: /docu-sma-bugs.html - Title: Bug reports, errors

URL: /docu-sma-deptags.html - Title: South Saami dependency tags

URL: /docu-sma-grammartags.html - Title: Overview

URL: /docu-sma-lex.html - Title: Documenting the South Saami lexicon file

URL: /docu-sma-morphophonology.html - Title: South Saami morphophonological processes

URL: /docu-sma-testplan.html - Title: Test plan for sma

URL: /docu-sma-twol.html - Title: Documentation of South Saami rules

URL: /docu-sma-verbs.html - Title: Souths Saami verb morphology

URL: /gramcheck/collecting-developer-texts.html - Title: Collecting developer texts

URL: /gramcheck/ - Title: Grammar checker for South Saami

URL: /index-header.html - Title: South Sámi documentation

URL: / - Title: South Sámi documentation

URL: /lemma.html - Title: Prinsipp for lemmatisering av sørsamisk

URL: /normativity-issues.html - Title: Background

URL: /sma-korpus-innsamling.html - Title: Korpusmøte for sma

URL: /sma-testdiary.html - Title: Test results for the morphology and lexicon files

URL: /sma.html - Title: South Sámi language model documentation

URL: /sma_lemma.freq.html - Title:

URL: /sma_wf.freq.html - Title:

URL: /src-cg3-disambiguator.cg3.html - Title: S O U T H   S Á M I   D I S A M B I G U A T O R

URL: /src-cg3-valency.cg3.html - Title: S O U T H   S Á M I   V A L E N C Y A N N O T A T O R

URL: /src-fst-morphology-affixes-abbreviations.lexc.html - Title: Continuation lexicons for abbreviations

URL: /src-fst-morphology-affixes-adjectives.lexc.html - Title: Adjective affixes

URL: /src-fst-morphology-affixes-nouns.lexc.html - Title: Nominal inflection sublexica

URL: /src-fst-morphology-affixes-possessive-suffixes.lexc.html - Title:

URL: /src-fst-morphology-affixes-propernouns.lexc.html - Title: Proper nouns morphology

URL: /src-fst-morphology-affixes-symbols.lexc.html - Title: Symbol affixes

URL: /src-fst-morphology-affixes-verbs.lexc.html - Title: South Saami verbal inflection sublexica

URL: /src-fst-morphology-compounding.lexc.html - Title: South Sámi morphological analyser

URL: /src-fst-morphology-phonology.twolc.html - Title: South Sámi morphophonological rule set

URL: /src-fst-morphology-root.lexc.html - Title: South Sámi morphological analyser

URL: /src-fst-morphology-stems-adjectives.lexc.html - Title: Adjective stems

URL: /src-fst-morphology-stems-adverbs.lexc.html - Title:

URL: /src-fst-morphology-stems-nouns.lexc.html - Title: South Sámi nouns

URL: /src-fst-morphology-stems-numerals.lexc.html - Title:

URL: /src-fst-morphology-stems-pronouns.lexc.html - Title: South Saami pronouns

URL: /src-fst-morphology-stems-sma-propernouns.lexc.html - Title:

URL: /src-fst-morphology-stems-verbs.lexc.html - Title: Verb stems

URL: /src-fst-oahpa-filer-aff-adjectives-oahpa.lexc.html - Title: Adjective affixes

URL: /src-fst-oahpa-filer-stems-adjectives-oahpa.lexc.html - Title: Adjective stems

URL: /src-fst-phonetics-txt2ipa.xfscript.html - Title:

URL: /src-fst-transcriptions-transcriptor-abbrevs2text.lexc.html - Title:

URL: /src-fst-transcriptions-transcriptor-symbols2text.lexc.html - Title:

URL: /syntaks-testing.html - Title: Syntaks-testmateriale

URL: /tools-grammarcheckers-grammarchecker.cg3.html - Title:

URL: /tools-grammarcheckers-grc-disambiguator.cg3.html - Title: S O U T H   S Á M I   D I S A M B I G U A T O R

URL: /tools-tokenisers-tokeniser-disamb-gt-desc.pmscript.html - Title: Tokeniser for sma

URL: /tools-tokenisers-tokeniser-gramcheck-gt-desc.pmscript.html - Title: Grammar checker tokenisation for sma

URL: /tools-tokenisers-tokeniser-tts-cggt-desc.pmscript.html - Title: TTS tokenisation for smj

Root items:

URL: /ConvertingToApertium.html - Title: Convertingtoapertium

URL: /KompilereFST.html - Title: Kompilerefst

URL: /Links.html - Title: Links

URL: /adj-meeting-05-2009.html - Title: Stoda no

URL: /docu-sma-adjs.html - Title: Sørsamiske adjektiv, system

URL: /docu-sma-background.html - Title: Background information on the South Saami project

URL: /docu-sma-bugs.html - Title: Bug reports, errors

URL: /docu-sma-deptags.html - Title: South Saami dependency tags

URL: /docu-sma-grammartags.html - Title: Overview

URL: /docu-sma-lex.html - Title: Documenting the South Saami lexicon file

URL: /docu-sma-morphophonology.html - Title: South Saami morphophonological processes

URL: /docu-sma-testplan.html - Title: Test plan for sma

URL: /docu-sma-twol.html - Title: Documentation of South Saami rules

URL: /docu-sma-verbs.html - Title: Souths Saami verb morphology

URL: /gramcheck/ - Title: Grammar checker for South Saami

URL: /index-header.html - Title: South Sámi documentation

URL: / - Title: South Sámi documentation

URL: /lemma.html - Title: Prinsipp for lemmatisering av sørsamisk

URL: /normativity-issues.html - Title: Background

URL: /sma-korpus-innsamling.html - Title: Korpusmøte for sma

URL: /sma-testdiary.html - Title: Test results for the morphology and lexicon files

URL: /sma.html - Title: South Sámi language model documentation

URL: /sma_lemma.freq.html - Title: Sma_lemma.freq

URL: /sma_wf.freq.html - Title: Sma_wf.freq

URL: /src-cg3-disambiguator.cg3.html - Title: S O U T H   S Á M I   D I S A M B I G U A T O R

URL: /src-cg3-valency.cg3.html - Title: S O U T H   S Á M I   V A L E N C Y A N N O T A T O R

URL: /src-fst-morphology-affixes-abbreviations.lexc.html - Title: Continuation lexicons for abbreviations

URL: /src-fst-morphology-affixes-adjectives.lexc.html - Title: Adjective affixes

URL: /src-fst-morphology-affixes-nouns.lexc.html - Title: Nominal inflection sublexica

URL: /src-fst-morphology-affixes-possessive-suffixes.lexc.html - Title: Src-fst-morphology-affixes-possessive-suffixes.lexc

URL: /src-fst-morphology-affixes-propernouns.lexc.html - Title: Proper nouns morphology

URL: /src-fst-morphology-affixes-symbols.lexc.html - Title: Symbol affixes

URL: /src-fst-morphology-affixes-verbs.lexc.html - Title: South Saami verbal inflection sublexica

URL: /src-fst-morphology-compounding.lexc.html - Title: South Sámi morphological analyser

URL: /src-fst-morphology-phonology.twolc.html - Title: South Sámi morphophonological rule set

URL: /src-fst-morphology-root.lexc.html - Title: South Sámi morphological analyser

URL: /src-fst-morphology-stems-adjectives.lexc.html - Title: Adjective stems

URL: /src-fst-morphology-stems-adverbs.lexc.html - Title: Src-fst-morphology-stems-adverbs.lexc

URL: /src-fst-morphology-stems-nouns.lexc.html - Title: South Sámi nouns

URL: /src-fst-morphology-stems-numerals.lexc.html - Title: Src-fst-morphology-stems-numerals.lexc

URL: /src-fst-morphology-stems-pronouns.lexc.html - Title: South Saami pronouns

URL: /src-fst-morphology-stems-sma-propernouns.lexc.html - Title: Src-fst-morphology-stems-sma-propernouns.lexc

URL: /src-fst-morphology-stems-verbs.lexc.html - Title: Verb stems

URL: /src-fst-oahpa-filer-aff-adjectives-oahpa.lexc.html - Title: Adjective affixes

URL: /src-fst-oahpa-filer-stems-adjectives-oahpa.lexc.html - Title: Adjective stems

URL: /src-fst-phonetics-txt2ipa.xfscript.html - Title: Src-fst-phonetics-txt2ipa.xfscript

URL: /src-fst-transcriptions-transcriptor-abbrevs2text.lexc.html - Title: Src-fst-transcriptions-transcriptor-abbrevs2text.lexc

URL: /src-fst-transcriptions-transcriptor-symbols2text.lexc.html - Title: Src-fst-transcriptions-transcriptor-symbols2text.lexc

URL: /syntaks-testing.html - Title: Syntaks-testmateriale

URL: /tools-grammarcheckers-grammarchecker.cg3.html - Title: Tools-grammarcheckers-grammarchecker.cg3

URL: /tools-grammarcheckers-grc-disambiguator.cg3.html - Title: S O U T H   S Á M I   D I S A M B I G U A T O R

URL: /tools-tokenisers-tokeniser-disamb-gt-desc.pmscript.html - Title: Tokeniser for sma

URL: /tools-tokenisers-tokeniser-gramcheck-gt-desc.pmscript.html - Title: Grammar checker tokenisation for sma

URL: /tools-tokenisers-tokeniser-tts-cggt-desc.pmscript.html - Title: TTS tokenisation for smj

Directory items:

URL: /gramcheck/collecting-developer-texts.html - Title: Collecting developer texts