Lule Sami NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-smj

Lule Sáme Proper noun morphology !

Even syllable proper nouns

Unstressed last syllable

Words in ACCRA lexicons end on vowel, have no CG and get “even-syllable” case marking where case suffixes are added directly. Illative e:i, but not o:u. Last syllable is unstressed. Both non-assimilated and assmilated stems (although not all are fully, or correctly, assmilated)

LEXICON ACCRA-ani Vowel-final names where case endings are added directly, no cg. Illative e changes to i. Animales.

LEXICON ACCRA-obj Vowel-final names where case endings are added directly, no cg. Object names

LEXICON ACCRA-org Vowel-final names where caseendings are added directly, no cg. organizations

LEXICON ACCRA-mal Vowel-final names where case are added directly, no cg. Male names

LEXICON ACCRA-fem Vowel-final names where case endings are added directly, no cg. Female names

LEXICON ACCRA-femsur Vowel-final names where case endings are added directly, no cg. Female names also used as surnames

LEXICON ACCRA-malfem Vowel-final names where case endings are added directly, no cg. Names that can be both female and male names

LEXICON ACCRA-objplc Vowel-final names where case endings are added directly, no cg. Names that can be both objects and place names

LEXICON ACCRA-femplc Vowel-final names where case endings are added directly, no cg. Names that can be both female and place names

LEXICON ACCRA-sur Vowel-final names where case endings are added directly, no cg. Surnames

LEXICON ACCRA-malsur Vowel-final names where case endings are added directly, no cg. Names that can be both male- and surnames

LEXICON ACCRA-plc Vowel-final names where caseendings are added directly, no cg. Place names

LEXICON ACCRA_MWE-plc Vowel-final names where caseendings are added directly, no cg. Place names

LEXICON GIRUNA-plc For proper Kiruna. Same as ACCRA. Different lexicon because of sma.

LEXICON ACCRA-LOAN-org Only nominatives. Vowel-final names where case endings are added directly, no cg. organizations

LEXICON ACCRA-LOAN-obj Only nominatives. Vowel-final names where case endings are added directly, no cg. Object names

LEXICON ACCRA-LOAN-plc Only nominatives. Vowel-final names where case endings are added directly, no cg.Place names

In smj RONDANE is same as ACCRA, in use in smi because of diffrences in sme. No -lasj or -k. Last syllable is unstressed. Non-assimilated-stems.

LEXICON RONDANE-plc E-final names, with no cg. elative -s, ill -ij. Place names

LEXICON RONDANE-SG-plc E-final names, with no cg. elative -s, ill -ij. Place names

LEXICON RONDANE-LOAN Only nominative.Place names

LEXICON RONDANE-SG-LOAN Only nominative. Place names

LEXICON RONDANE-sur Surnames

LEXICON RONDANE-obj Objects

LEXICON RONDANE-org Organizations

LEXICON RONDANE-mal Male names

LEXICON RONDANE-fem Female names

These sublexica are irrelevant for ACCRA, but added for the sake of the lexicon MARJA

GATA are Norwegian place names that end on -gata. Gets even-syllable casemarking. Last syllable is unstressed. Non-assimilated stems.

LEXICON GATA-plc Norwegian place names that end on -gata. Gets even-syllable casemarking. Last syllable is unstressed.

Words in MARJA end on vowel, with CG, even-syllable case marking. Illative change e to á, illative i stays i. Last syllable is unstressed. Real lule sami stems.

LEXICON MARJA-fem Odd-syllable with cg. Female names

LEXICON MARJA-ani Animal names

LEXICON MARJA-mal Male names

LEXICON MARJA-obj Objects

LEXICON MARJA-org Organizations

LEXICON MARJA-plc Vowel final names with Gradation and Ill change (place names)

LEXICON MARJA-sur Surnames

LEXICON MARJA-plc-der = place name derivations and corresponding flag. Presently not used in SMJ.

LEXICON SUOBMA-plc Placenames. Like MARJA but no derivation

LEXICON SUOBMA-org Placenames. Like MARJA but no derivation

Stressed last syllable

These proper nouns are in essence partly assimilated loan word as foreign words with stressed last syllable are assimilated to sami by (often adapting the stressed syllable vowel, and) adding an unstressed syllable consisting of adapted (or if necesarry added) consonants and ending on vowel a (Morén-Duollja 2014). Proper nouns are only partly assimilated in that the stressed syllable vowel is not adapted in any way, neither are consonats inserted, only the final “a” remains. These proper nouns therefore work like regular a-stem nouns and get an even syllable case marking.

Words in lexicon NYSTØ end on vowel, no cg. Non-assimilated stems

LEXICON NYSTØ-fem Femal names

LEXICON NYSTØ-mal Male name

LEXICON NYSTØ-obj Objects

LEXICON NYSTØ-org Organizations

LEXICON NYSTØ-LOAN-org Organizations loan

LEXICON NYSTØ-sur Sur names

LEXICON NYSTØ-LOAN-plc Place names loan

LEXICON NYSTØ-plc Place names

LEXICON NYSTØ_MWE-plc Place names

Words in DUBAI lexicon end on vowel+vowel and have no cg. Last syllable is stressed. Get even syllable case marking. Non-assimilated stems. Not sure if this lexicon is necessary, at least for smj’s sake.

LEXICON DUBAI-fem I-final names. No cg. Female names

LEXICON DUBAI-obj I-final names. No cg. Object names

LEXICON DUBAI-org Organizations

LEXICON DUBAI-mal Male names

LEXICON DUBAI-sur Surnames

LEXICON DUBAI-plc Place names

Words in lexicon BERN end on conconant, no cg, even syllable case marking with -av, -aj, -as, etc. Last syllable is stressed. Both assimilated and non-assmilated stems.

LEXICON BERN-ani Animals

LEXICON BERN-mal Male names

LEXICON BERN-surmal name that are both sur- and male names

LEXICON BERN-fem Female name

Different lexicon for female persons. Audhild.

LEXICON BERN-sur Surnames

LEXICON BERN-plc Placenames

LEXICON BERN_MWE-plc Placenames

LEXICON BERN-objsur Names used as both objects and surnames.

LEXICON BERN-orgsur Names used for both organizations and surnames.

LEXICON BERN-obj Objects. Obs: Different lexicon for organisations. Microsoft.

LEXICON BERN-org Organizations

LEXICON BERN-LOAN-org Organizations loan.

LEXICON BERN-LOAN-plc Placenames loan.

LEXICON BERN-LOAN-obj Objects loan.

Different lexicon for names that are both surnames and places.

Lexicons OY work as BERN lexicons

Words in LONDONBERN are sent to both LONDON and BERN lexicons. Non-assmilated stems.

4-syllable stems

Words in lexicon BASUDIS are trisyllabic in sg nom, and work like standard 4-syllable nouns. End on conconant and have cg. Even syllable case marking with acc -áv, ill -áj, ela -ás, etc. Real lule sami stems.

LEXICON BASUDIS-org Only singular. Placenames

LEXICON BASUDIS-mal Male names

LEXICON BASUDIS-plc Place names

Plurals

Words in lexicon VARGGAT even-syllable sámi plurals .

LEXICON VARGGAT-plc Plural stems, sáme names. Place names

LEXICON VARGGAT-org Plural stems, sáme names.

Words in lexicon ALEUHTAT even-syllables assimilated plurals.

LEXICON ALEUHTAT-plc Plural names, not sami names. like -váre, -gårtje

Odd syllable case marking

Words in lexicon LONDON end on conconant, no cg, case marking with -av, -ij, -is, etc. Last syllable is unstressed. Gets a regular odd syllable case marking. Both real lule sami stems, assimilated stems and non-assimilated stems

LEXICON LONDON-sur Odd-syllable. Surnames. Final foot structure (X.) and (X..) => Loc:%>is

LEXICON LONDON-ani Animals

LEXICON LONDON-org Only singular Organizations

LEXICON LONDON-mal Male names

LEXICON LONDON-malsur Names that can be both male- and surnames. Not used in smj-propernouns

LEXICON LONDON-fem Female names

LEXICON LONDON-malfem Names that can be both male and female names.Not used in smj-propernouns

LEXICON LONDON-malplc Names that can be both male- and placenames.Not used in smj-propernouns

LEXICON LONDON-plc Only singular. Placenames

LEXICON TJIERREK-plc Only singular. Placenames. Same as LONDON, but does not get Sem/Sur tag, not usuall for SMJ place names to become surnames.

LEXICON LONDON-orgsur Names that can be both organizations and surnames.Not used in Smj-propernouns

LEXICON LONDON-obj Objects.

LEXICON LONDON-LOAN-obj Objects loan. Not used in smj-propernouns

LEXICON LONDON-LOAN-plc Only nominatives. Placenames loan. Not used in Smj-propernouns

LEXICON LONDON-LOAN-org Only nominative. Organizations loan.Not used in smj-propernouns

JOKULL-plc are placenames. Lexicon added to make the code compile (?)

+N+Prop+Sem/Plc: LONDONDECL-PLC-SUR ; Placenames. NB added to make the code compile, needs revision. Gets an odd syllable case marking. Non-assimilated stems.

Words in lexicon ANAR end on conconant, no cg, case marking with ill -ij, ela -is. Gets an odd syllable case marking. Lule sami stems.

LEXICON ANAR-mal Male names.

LEXICON ANAR-plc Place names

Words in PIPPI lexicons are i-final, have no cg, no second syllable vowel change, and get odd syllable case marking with acc -hav, ill -hij, elat -his, etc. Works as “riebij”, but without the -j in nominative (it should maybe be Sirij and Pippij in nom?) and without cg. The last syllable is unstressed. Non-assimilated stems.

LEXICON PIPPI-ani IVowel-final names where case endings are added directly, no cg. Animals.

LEXICON PIPPI-obj Vowel-final names where case endings are added directly, no cg. Object names

LEXICON PIPPI-org Vowel-final names where caseendings are added directly, no cg. organizations

LEXICON PIPPI-mal Vowel-final names where case are added directly, no cg. Male names

LEXICON PIPPI-fem Vowel-final names where case endings are added directly, no cg. Female names

LEXICON PIPPI-femsur Vowel-final names where case endings are added directly, no cg. Female names also used as surnames

LEXICON PIPPI-malfem Vowel-final names where case endings are added directly, no cg. Names that can be both female and male names

LEXICON PIPPI-sur Vowel-final names where case endings are added directly, no cg. Surnames

LEXICON PIPPI-plc Vowel-final names where caseendings are added directly, no cg. Place names

LEXICON PIPPI-LOAN-plc Only nominatives. Vowel-final names where case endings are added directly, no cg.Place names

Words in lexicon DUORTNUS end on conconant, have cg and second syllable vowel change o:u, e:á. Odd syllable case marking. Real lule sami or one non-assimilated stem.

LEXICON DUORTNUS-mal Male names

LEXICON DUORTNUS-sur Male names

LEXICON DUORTNUS-org Odd-syllable ending on consonant, with cg. Organizations

LEXICON DUORTNUS-plc Odd-syllable ending on consonant, with cg.Placenames

LEXICON TIEMPEL-obj Same as DUORTNUS, only without second syll vowel change. Odd syllanle case marking Lexicon presently only for two -tiempel-final words. Lule sami stems.

LEXICON TIEMPEL-org Same as DUORTNUS, only without second syll vowel change. Odd syllanle case marking Lexicon presently only for two -tiempel-final words. Lule sami stems.

Lexicon HEANDARAT is not in use in smj

+Pl+Nom:aQ1 K ; +Pl+Gen:aQ1j K ; +Pl+Gen:aQ1j RHyph ; +Pl+Acc:aQ1jt K ; +Pl+Ill:aQ1jda K ; +Pl+Ine:aQ1jn K ; +Pl+Ela:aQ1js K ; +Pl+Com:aQ1j K ;

Words in lexicon EATNAMAT are odd-syllable plurals. Lule sami stems and non-assimilated stems.

LEXICON EATNAMAT-plc Place names. Presently only for Vuolleednama

LEXICON EATNAMAT-org Organizations

Contracted proper nouns

Words in lexicon DAVVISUOLLU are contracted propernouns ending on -åj/-oj. Lule sami stems

LEXICON DAVVISUOLU-plc Contracted stems ending on -oj. Place names.

Words in lexicon GEAVNNIS are contracted propernouns ending on -s.

LEXICON GEAVNNIS-plc Contracted stems ending on -es. Place names. Lule sami stems.

Words in lexicon SUOLLOT are contracted plurals. Lule sami stems.

LEXICON SULLOT-plc Plural names, only names ending on -suollu.

Lexicons only used in sme/sma and that are sent to other lexicons in smj

ERVASTI is only used in smi-propenouns. Ervasti names are 3-syllable and are needed as a seperate lexicon because of sma. ERVASTI is same as ACCRA in smj and gets even syllable case marking.

MAKI and NIEMI is only used in smi-propenouns. Maki names are even-syllable finnish names and are needed as a seperate lexicon because of sma. MÄKI is same as ACCRA in smj and gets even syllable case marking.

HANNOLA is the same as ACCRA


This (part of) documentation was generated from src/fst/morphology/affixes/propernouns.lexc