S O U T H S Á M I D I S A M B I G U A T O R
Delimiters, tags and sets
"<.>" "<!>" "<?>" "<...>" "<¶>" sent
Tags
BOS/EOS:
(>>>) (<s>)(<<<) (</s>)
Morphological tags
- N
- Sg Pl
-
Nom Acc Gen Ine Ela Ill Com Ess
-
PxSg1 PxSg2 PxSg3 PxPl1 PxPl3 PxPl3
- Sg1 Sg2 Sg3 Pl1 Pl2 Pl3 ;
Derivation tags
Der/ADer/CarDer/DiminDer/InchLDer/NomActDer/NomAgDer/PassLDer/PassSDer/RecDer/adteDer/ahtjeDer/allaDer/dDer/edsDer/htDer/htalleDer/htjDer/ihksDer/ijesDer/lDer/laakanDer/ldahkeDer/ldhDer/ldihkieDer/lesDer/lgDer/stDer/vuota
Error usage tags
We define two lists for Err/xxx tags:
Err/Orth:Err/OrthErr/Orth-a/áErr/Orth-nom/genErr/Orth-nom/accErr/DerSubErr/CmpSubErr/UnspaceCmpErr/HyphSubErr/SpaceCmpErr/Spellrelaxerr_orth_mt
Err/Orth-spes:Err/Orth-a/áErr/Orth-nom/genErr/Orth-nom/accErr/DerSubErr/CmpSubErr/UnspaceCmpErr/HyphSubErr/SpaceCmpErr/Spellrelaxerr_orth_a_á_mterr_orth_nom_acc_mterr_orth_nom_gen_mt
Other tags
Cmp/Hyph<vdic>
Other secondary tags
Semantic tags
Secondary tags
Syntactic tags
- @CNP @CVP @+FAUXV @+FMAINV @-FAUXV @-FMAINV MAINV =
Titles
REAL-TITLE OFFICE TITLE
Sets
Sets of morphological tags for syntactic use
CASES ADVLCASE NUMBER
Noun sets
INSTITUTION ORGANIZATION EDUCATION CURRENCY CURRENCY LESSON
Verb sets
REALCOPULAS
COPULAS
V-NOT-COP
MOD-ASP
Adjective sets
Adverb sets
GUKTIEGOSSE
DAESTIE
ILLADV
INEADV1
ELAADV1
INEADV
ELAADV
DV-MOD-ADV
Postposition sets
ILLPO
BOUNDARY SETS
REALCLB
SV-BOUNDARY
NP-BOUNDARY
Derivation sets
V-DER
V-DER-SUF
N-DER N-DER-SUF
A-DER A-DER-SUF
PASS
LEX-V LEX-N LEX-A LEX-ADV
VERB-FORMS 2-PERS
Disambiguation rules
BEFORE-SECTIONS
Rule for adding Sem/Date as a tag to readings which looks like dates (fjernes når vi får felles numeralfil fra shared)
Guessing: Rule for adding Adv Sem/Adr as a tag to readings which looks addresses
Guessing: Rule for adding Adv Sem/Adr as a tag to readings which looks addresses
Rules for adding
SECTION
Cycle 0 (Early rules)
Removing non-lexicalised forms when lexicalised
Numerals and ACR
Numerals in QPs
CC og not (spesifikke regler lenger ned)
Interj
Possessive suffix
REmove Px if not family
Pronouns
Proper nouns
INITIAL
Verbs
Postpositions
Selecting postpositions when preceded by genitives, etc.
Particles and adverbs
Adjective or Indef
Demonstratives
Genitive
Adjective or not
Rel or Interr OR Indef
Adverbs
Selecting adverbs in local contexts
Verbs
Selecting verbs in local contexts, based upon agreement patterns
Selecting imperative sentence-initially with appropriate right context
Remove verb readings
Select Inf
Mapping rules
CC- and CS-Mapping
- COMPCS @COMP-CS< to Adv or A after goh etc.
CNP mapping
Mapping CNP to CC and CS.
CVP Mapping
Mapping @CVP to all CS
Attributes or not
PrfPrc
Select PrfPrc if DerNomAct
Mapping verbs
killifVinCohort
This rule removes all other readings, if there is a mapped V reading in the same cohort. Every case which this goes wrong, should be fixed in mapping rules or previous disrules.
Person
leah Prs Sg2 = Pl3
Select Inf If Infv
Span sentences
Nomen
Remove Prop Attr if not 1 Prop
Verb or Noun
CC and CS or Adv
Adj or Adv
Grammatisk ord eller N eller A
N or V
Ger or Der/NomAct
Adj or Indef
Num
Adv or Po/Pr
Illative or genetive
Essive
Comitative
Accusative or illative
Indef or Adv
special lemmas
Adverb context prefers Adv
Verb person vs. Inf – moved here in order to have the pronouns disambiguated first.
Proper nouns
Rule set taken from sme
gellie as numeral, not pronoun
This (part of) documentation was generated from tools/grammarcheckers/grc-disambiguator.cg3