Lule Sami NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-smj

Page Content

Overview

All the words are analysed with dictionary form + grammatical tags. Each tag is introduced with a “+” sign. We thus have

bijlav bijlla+N+Sg+Acc
boadáv boahtet+V+Ind+Prs+Sg1

The tag scheme follows Spiik and Nystø/Johnsen. It is not as different from North Sámi, the main differences are due to the grammatical differences between the languages in question.

Note that you have to add the part of speech twice when you use the derivational tag:

gåhtjoduvvá gåhttjot+V+Der/Pass+V+Ind+Prs+Sg3

The main parts of speech (POS) for Lule Saami are:

Nouns, adjectives, verbs, pronouns, adverbs, particles, subjunctions, conjunctions, postpositions, prepositions and interjections.

The nouns

Here is a list of the nominal tags, given in the same order as for the analysis. Note that the grammatical categories in parentheses can be omitted.

   
Part of speech:  
Number: +Sg, +Pl
Case: +Ess +Nom +Gen +Acc +Ill +Ine +Ela +Com
(Possessive suffix:) +PxSg1 +PxSg2 +PxSg3 +PxDu1 +PxDu2 +PxDu3 +PxPl1 +PxPl2 +PxPl3
(Clitic:) +Foc

The adjectives

Used non-attributively the adjective resembles the noun:

   
Part of speech: +A
(Grade:) +Comp, +Superl
Number: +Sg, +Pl
Case: +Ess +Nom +Gen +Acc +Ill +Ela +Ine +Com
(Clitic:) e.g. +Foc/ (see list)

Used attributively the adjective has a quite simple tag scheme:

   
Part of speech: +A
Attribute +Attr
(Clitic:) e.g. +Foc (see list)

The verbs

Finite and infinite verb forms have quite distinct paradigms. Finite first:

   
Part of speech: +V
(Derivation) +Der/Pass
Mood: +Ind, +Pot, +Cond, +Imprt
Tense +Prs, +Prt
Person/Number +Sg1 +Sg2 +Sg3 +Du1 +Du2 +Du3 +Pl1 +Pl2 +Pl3, +ConNeg
Clitic: e.g. +Foc (see list)

Infinite verb forms:

   
Part of speech: +V
(Derivation:) +Der/Pass
Nominal verb form +Inf, +Actio, +Ger, +PrsPrc, +PrfPrc, +VGen, +VAbess
(Clitic:) e.g. +Foc (see list)

Other derived verb forms:

   
Part of speech: +V
Part of speech: +N
Derivation +Actor
Number +Sg, +Pl
Case: +Ess +Nom +Gen +Acc +Ill +Ela +Ine +Com
(Possessive suffix:) +PxSg1 +PxSg2 +PxSg3 +PxDu1 +PxDu2 +PxDu3 +PxPl1 +PxPl2 +PxPl3

Here is an example: “åhpadiddje” > åhpadit+V+TV+Der3+Actor+N+Pl+Nom

These tags are still not documented: +Actio +Neg +ImprtII

The pronouns

The personal, demonstrative and interrogative pronouns:

baseform+Pron+Pers+NumberPerson+Case

Example:

underlying form: mon+Pron+Pers+Sg1+Com, surface form: mujna
underlying form: dat+Pron+Dem+Pl+Nom, surface form: da

The reflexive pronouns:

baseform+Pron+pronoun_type+Case(+possessive suffix)

Example: underlying form: iesj+Pron+Refl+Ine+PxDu1, surface form: ietjanimme

The indeclinable words

These have their POS tag as their only tag:

underlying form: birra+Pr or birra+Po
surface form:    birra

Alphabetic list over the tags

Part of speech

Sub-part of speech

Grammatical properties

Derivational suffix tags

Semantic tags

These are tags used for classifying names, e.g. +Prop+Sem/Fem

Sitemap