Inari Sámi NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-smn

Page Content

TAGS AND SETS

Tags

This section lists all the tags inherited from the fst, and used as tags in the syntactic analysis. The next section, Sets, contains sets defined on the basis of the tags listed here, those set names are not visible in the output.

Beginning and end of sentence

BOS EOS

Parts of speech tags

N A Adv V Pron CS CC CC-CS Po Pr Pcle Num Interj ABBR ACR CLB LEFT RIGHT WEB PPUNCT PUNCT

COMMA ¶

Tags for POS sub-categories

Pers Dem Interr Indef Recipr Refl Rel Coll NomAg Prop Allegro Arab Romertall

Tags for morphosyntactic properties

Nom Acc Gen Ill Loc Com Ess Ess Sg Du Pl Cmp/SplitR Cmp/SgNom Cmp/SgGen Cmp/SgGen PxSg1 PxSg2 PxSg3 PxDu1 PxDu2 PxDu3 PxPl1 PxPl2 PxPl3 Px

Comp Superl Attr Ord Qst IV TV Prt Prs Ind Pot Cond Imprt ImprtII Sg1 Sg2 Sg3 Du1 Du2 Du3 Pl1 Pl2 Pl3 Inf ConNeg Neg PrfPrc VGen PrsPrc Ger Sup Actio VAbess

Tags for clitic particles

Foc/ge Foc/gen Foc/ges Foc/gis Foc/naj Foc/ba Foc/be Foc/hal Foc/han Foc/bat Foc/son

Derivation tags

Der/PassL Der/PassS Der/NomAg NomAg Der/alla Der/d Der/Car Der/Car Der/lasj Der/NomAct Der/st Der/upmi Der/vuota Der/InchL Der/Dimin Der/Aadv Der/Date Der/Year Der/Hum

Err/Orth Err/Orth-spes ### Semantic tags Sem/Act Sem/Ani Sem/Atr Sem/Body Sem/Clth Sem/Domain Sem/Feat-phys Sem/Fem Sem/Group Sem/Lang Sem/Mal Sem/Measr Sem/Money Sem/Obj Sem/Obj-el Sem/Org Sem/Perc-emo Sem/Plc Sem/Sign Sem/State-sick Sem/Sur Sem/Time Sem/Txt HUMAN PROP-ATTR PROP-SUR TIME-N-SET NOT-TIME TIME-N ### secondary tags ### Syntactic tags @+FAUXV @+FMAINV @-FAUXV @-FMAINV @-FSUBJ> @-F<OBJ @-FOBJ> @-FSPRED<OBJ @-F<ADVL @-FADVL> @-F<SPRED @-F<OPRED @-FSPRED> @-FOPRED> @>ADVL @ADVL< @<ADVL @ADVL> @ADVL @HAB> @<HAB @>N @Interj @N< @>A @P< @>P @HNOUN @INTERJ @>Num @Pron< @>Pron @Num< @OBJ @<OBJ @OBJ> @OPRED @<OPRED @OPRED> @PCLE @COMP-CS< @SPRED @<SPRED @SPRED> @SUBJ @<SUBJ @SUBJ> SUBJ SPRED OPRED @PPRED @APP @APP-N< @APP-Pron< @APP>Pron @APP-Num< @APP-ADVL< @VOC @CVP @CNP OBJ -OTHERS SYN-V @X ## Sets containing sets of lists and tags This part of the file lists a large number of sets based partly upon the tags defined above, and partly upon lexemes drawn from the lexicon. See the sourcefile itself to inspect the sets, what follows here is an overview of the set types. ### Sets for Single-word sets go, and the set INITIAL for initial letters go INITIAL ### Sets for word or not WORD WORD-NOT-de NOT-COMMA ### Derivational affixes DER-V DER-N DER-A1 DER-A A-V A-NOT-V ### Case sets ADLVCASE CASE-HALFAGREEMENT CASE-AGREEMENT CASE NOT-NOM NOT-GEN NOT-ACC ### Verb sets NOT-V ### Sets for finiteness and mood REAL-NEG MOOD-V GC VFIN VFIN-POS VFIN-NOT-IMPRT VFIN-NOT-NEG NOT-PRFPRC ### Sets for person SG1-V SG2-V SG3-V DU1-V DU2-V DU3-V PL1-V PL2-V PL3-V ### Sets consisting of forms of "leđe" (these ones need to be rewritten) ### Pronoun sets ### Adjectival sets and their complements ### Adverbial sets and their complements ### Sets for coordinators ### Sets for adverbs that have lookalikes Here come some adverbs that have identical twins in other POS. If these are found in Adv contexts, we treat them as adverbs. ### Sets of elements with common syntactic behaviour ### Sets for verbs V is all readings with a V tag in them, REAL-V should be the ones without an N tag following the V. The REAL-V set thus awaits a fix to the preprocess V ... N bug. * The set COPULAS is for predicative constructions TRANS-V is the set for verbs really taking objects * Sets for verbs choosing oblique objects or adverbials * **STVLIST** is the list of strictly transitive verbs. In the rules, refer not to STVLIST, but to the set STV defined below. STRICT-TRANS-V is the set for verbs which don't let a GenAcc be a modifier of anything else than an object, e.g. Mun organiseren eatni gievkkanis. - eatni wants to be the object ### Valency sets * **PLACE-V** Those get only not locative if the target is a member TOOL, ABSTR-TOOL or ANIMATE or CONCEPT. Selects more locatives than ONLY-PLACE-LOC-V ### Adverb sets ### Adjective sets ### NP sets defined according to their morphosyntactic features ### The PRE-NP-HEAD family of sets These sets model noun phrases (NPs). The idea is to first define whatever can occur in front of the head of the NP, and thereafter negate that with the expression **WORD - premodifiers**. The set **NOT-NPMOD** is used to find barriers between NPs. Typical usage: ... (*1 N BARRIER NPT-NPMOD) ... meaning: Scan to the first noun, ignoring anything that can be part of the noun phrase of that noun (i.e., "scan to the next NP head") ### Other negatively defined morphosyntactic noun sets ### Noun sets Nominal sets defined according to their morphophonological properties Sets for lexeme homonymy (most of them are moved to where the actual rules are.) The words in the set **N-PO** can be both N and Po, the set takes that into account. ### Nominal sets defined according to their semantical properties * Spatial noun sets. These nouns behave like postpositions * Time sets * Amount sets OKTA * Sets for nouns with morpho-syntactic preferences * Number-related sets * Sets for case, possessive, etc. * Sets for nouns as pred * Sets for animals * Sets for things * Sets for qualities * Sets for things, not necessarily tools * Sets for things such that people can be inside them: * Sets for things such that people cannot be inside them: * Part-whole sets for human * Sets for places * Sets that can both be buildings/places and represent humans * Sets denoting relations ### Miscellaneous sets ### Border sets and their complements ### Syntactic sets *These were the set types.* * * * This (part of) documentation was generated from [tools/grammarcheckers/spellchecker.cg3](https://github.com/giellalt/lang-smn/blob/main/tools/grammarcheckers/spellchecker.cg3)