North Sami NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-sme

Numerals

Background

Some Numeral meeting is long overdue

Potential obstacle

Different people have different things they want to discuss. Now, we just jot down what we want, and then go to lunch.

Question to think of:

What part of the list below as a plenary discussion for all, and what for only some.

Topics

  1. The TODO-list (all)
  2. Morphology (Trond, Thomas, Sjur?, Börre?)
  3. Spelling: Recursivity and the non-recursive transducers (Thomas,)
  4. Preprocessing (Trond, Saara, no need to meeting on this, there is a bug report)
  5. Morphosyntax (Trond, Linda, others?)

The TODO-list:

Fleshing out some priority items:

Top-priority: make num-paradgim (Trond) Done.

cvs up
cd gt/sme/testing
make num-paradigm WORD=guokte

Thereafter: someone to test the output (let’s see how it looks before we decide who)

Speller: numbers in non-recursive

Morphology

Split the lexicon into circular and non-circular parts:

golbma#cuohti#vihtta#duhát = four parts

ok - 300 000 ?

golbma#cuohti#duhát = three parts


"<golbmačuohtiduhát>"
        "golbmačuohtiduhát" Num Sg Nom <<< @SPRED

thus case inflection on all parts? yes, and that is not implemented. I suggest we do that after the beta release.

That is fine with me, we don’t need perfect numeral coverage in the beta

Have a look at the Finnish numeral example from the book, see [this link. | http://www.stanford.edu/~laurik/fsmbook/LSA-207/Scripts/finnum/Finnish%20Numerals.html]

Complex numerals written as single words

Nothing here yet

Spelling

Cf. the num.plx.txt list (output from the present PLX number conversion). We need a clarification of the num.plx.txt list. What is in, what is out, and why. Börre?

Preprocessing

Issue: We analyse numerals at the end of sentences as ordinals. Oslo analyses it as cardinals. Arguments for our view? I (TT) am leaning toward the Oslo solution as better.

I could do some research with both options. That’s fine. I remember there was a request to analyze end of sentences as ordinals, but it can be otherwise.

Examples of sentence final numerical expressions:

    1. Dat lea s. 240.
    1. Dieđáhusa nummar okta.
    1. Mun lean ilus go beasan ovdanbuktit St.dieđ. nr. 33.
    1. Mun boađán diibmu vihtta/viđas.
    1. Máŋggabealatvuohta ja ovttadássásašvuohta - ráđđehusa dearvvašvuođa- ja
   sosiálabálvalusaid doaibmaplána Norgga sápmelaččaid várás 2002-2005.
        1. There is no problem with years, they are not regarded as ordinals in the
    preprocessor in any case. How do you know it is a year?  we have rules that
    say it is a year after "jagi" and we have a Range analysis for "-"
    expressions, but isn't it a usual numeral (potentially) otherwise?
    1. He owned kr 10000.
    1. She was only 16.
    1. The population is 120.000.
    1. He reads from book 4.
        1. Well, I don't go so deep to details, I just assume that a number containing
    4 digits is a year. Of course there are errors, but not so many.
        1. I'm not so sure since we have a lot of law texts to which get numbers and
    other entitites which just as well could get 4 digits, it would be nice
    though to have some help for recognizing years
        1. Trond and I have worked on date expressions such as 21.5.2000 which are now
    clearly identified as dates, that helps a lot
        1. Yes, they are fine. The thing is, at the moment I don't have any better
    heuristics for recognizing years, if they are not part of some date
    expression like above. It may also be that the date expressions in analyzer
    are not compatible with preprocessor, since I haven't updated it lately.
        1. But we are maybe talking of two issues here. The expressions with 4 digits
    are not considered as ordinals (since they are assumed to be years) but the
    question is if there are other 4-digit expressions that are ordinals
    contrary to the assumption.
        1. would it be possible to get a tag for the 4-digit expressions so that we
    could use it in the rule file?
        1. counterexamples could be:
    1. I took part in a race. I got 1345.
        1. Probably not very often in our corpus :)
        1. Exactly, I think this is an empirical question.


Empirical investigation:
kwic-snt '[a-z] [0-9]\. [A-Z]
So far, ONE example with sentence-final ordinals (Nordlys). 
tafettinnspurt greide Trude å sikre Kjelsås sitt 10. NM-gull. 
er til Kypros torsdag 2. februar, proffene søndag 5. Roger Nilsen   <== 1/100
  real example, ellipsis                              

probably not..unless min aigi desides to write about large-scale sports with placing

Ok. What if you just punched some Finnish text through the preprocessor and looked for the final numerals, to get an impression? I am pretty sure Fi and Sa behave the same.

So, we have a verdict, and a new system.

Syntax

Date

### Range ### Numbers after ":" such as in "faksa: 2323232323" I tried to solve that problem before when there were still sentence borders, so that the numeral item after the communication-device would get @SPRED (which was at least my favorable analysis) but after the sentence borders disappeared the analysis does not work any more. My proposal would be to make a set of communication devices with numbers and make a new rule ``` "" "Statens" N Prop Attr @PROP> "" "forvaltningstjeneste" ? @X "" "Informasjonsforvaltning" ? @X "" "poasta#boksa" N Sg Nom @SUBJ "<8169>" "8169" Num Sg Acc @OBJ "8169" Num Sg Gen @QN> "" "poasta#boksa" N Sg Nom @HNOUN "<8169>" "8169" Num Sg Nom @NumN< "<.>" "." CLB <<< Status? ``` ### functional numbers in texts (such as ennumerations etc.) This is a formating problem. I would prefer to mark those numbers as text-functional numbers, otherwise we always have the problem that they could be either quantifiers (post or pre) or one of the "thousand" different functions of numerals... ``` "<¶>" "¶" CLB <<< "" "SISDOALLU" ? @X "" "ovda#sátni" N Sg Nom @HNOUN "<...>" "..." CLB <<< "<5>" "5" Num Sg Nom @HNOUN "<¶>" "¶" CLB <<< "<1.>" "1" A Ord @X "<.>" "." CLB <<< "<12>" "12" Num Sg Gen @QN> "<2.1>" "2.1" Num Sg Gen @QN> "<2.2>" "2.2" Num Sg Gen @QN> "<2.3>" "2.3" Num Sg Gen @QN> "<2.4>" "2.4" Num Sg Gen @QN> "" "sáme#politihkalaš" A Attr @AN> "" "perspektiiva" N Pl Nom @HNOUN "<...>" "..." CLB <<< "<12>" "12" Num Sg Gen @GN> "" "riikka#viiddus" N* Der1 Der/Dimin N Sg Nom @HNOUN "" "diehto#juohkin#doaibma" N Pl Nom @SPRED ``` ### Numbers in date expressions there is a lot of creativity among dates (written in several or just one words, leaving out month or day, writing b or beaivvi probably not a big problem, even though sometimes I am unsure about the case ``` "" "gustojeaddji" A Attr @AN> "" "luodda#plána" N Sg Nom @SUBJ "" "dohkkehit" V TV Der2 Der/Pass Ind Prt Sg3 @+FMAINV "" "guovva#mánnu" N Sg Gen @ADVL "<1997>" "1997" Num Sg Nom @SUBJ "1997" Num Sg Nom @APP "1997" Num Sg Nom @HNOUN "" "ja" CC @CC-VP "" "sisttis#doallat" V TV Ind Prs Sg3 @+FMAINV "" "njuolggadus" N Pl Acc @OBJ "" "ahte" CS @CS-VP "" "makkár" Pron Dem Attr @DN> "" "eaktu" N Pl Nom @SUBJ "" "galgat" V IV Ind Prs Pl3 @+FAUXV "" "leat" V IV Inf @-FMAINV "" "deavdit" V TV Der2 Der/Pass PrfPrc @-FMAINV "" "go" CS @CS-VP "" "klassifiseret" V TV Ind Prs Sg3 @+FMAINV "" "suohkanlaš" A Attr @AN> "" "luodda" N Sg Acc @OBJ "<.>" "." CLB <<< ``` ### Numbers postmodifying nouns * so etwas wie láhka nr. 67, jahki 2000,... * The láhka nr. 45 already has the analysis we want (the @NNum< must be interpreted as "not the one to the left, but the one to the left of that again. I was thinking about more complex expressions, I can look after concrete examples Gut. ``` "" "láhka" N Sg Nom @SPRED "" "nr" ABBR N Nom @NNum> "nr" ABBR N Gen @NNum> "<45>" "45" Num Sg Nom <<< @NNum< ``` That is a wrong analysis... ### Concrete analyses ``` golbma maŋemus jagi leat lassánan buhtadusat boraspiriid geažil. "" S:14943, 14945, 15284 "golbma" Num Sg Nom S:3314 @SUBJ <==== ADVL "" S:8267 "maŋemus" A Attr S:2431 @AN> "" S:9657 "jahki" N Sg Gen S:2900 @NQ< "" S:8607, 12762, 13029, 13029, 13276, 13276, 14531 "leat" V IV Ind Prs Pl3 S:3102 @+FAUXV "" S:3610, 5049, 5049, 8834 "lassánit" V IV PrfPrc S:3162 @-FMAINV "" S:10195, 10358, 10358, 10586 "buhtadus" N Pl Nom S:3314 @SPRED "" S:9683 "bora#spire" N Pl Gen S:2798 @GP> "" S:5921 "geažil" Po S:3419 @ADVL "<.>" "." CLB <<< [ "<62>" "62" <=== head "" Num "milj" ABBR Num Gen @NumQ< <=== "milj" ABBR Num Acc @NumQ< "<10>" "10" Num Sg Nom @SUBJ-QH "" "kr" ABBR Gen <<< @NQ< <10>" S:10069, 10069, 14874, 15260 "10" Num Sg Gen S:2683 @QN> "" S:10069 "miljon" Num Sg Gen S:2647 @NumQ< "" S:3583, 15042 "ruvdnu" N Sg Acc <<< S:3397 @OBJ ok: "<10>" S:10072, 10072, 15392, 15468 "10" Num Sg Nom S:3360 @SUBJ "" S:9871 "miljon" Num Sg Gen S:2647 @NumQ< "" S:9851 "ruvdnu" N Sg Gen S:2650 @NQ< "" S:15390 "leat" V IV Ind Prs Sg3 S:3145 @+FMAINV "" "stuoris" A Attr S:2449 @AN> "" S:10090, 10090, 10804, 14722 "summa" N Sg Nom S:3360 @SPRED "<.>" "." CLB <<< ``` what modifies what? > 2006 stáhtabušeahtas lea ráđđehus liigudan 10 milj. kr álggahanmearreruhtan ođđa dieđavistái. ``` "<2006>" "2006" Num Sg Acc @OBJ "" "stáhta#bušeahtta" N Sg Gen PxSg3 @NQ< "" "leat" V IV Ind Prs Sg3 @+FAUXV "" "ráđđehus" N Sg Nom @SUBJ "" "liigudit" V TV PrfPrc @-FMAINV "<10>" || "10" Num Sg Gen @QN> || "" || "milj" ABBR Num Acc @NumQ< || "milj" ABBR Num Gen @NumQ< || "" || "kr" ABBR Gen @GN> <== @NQ< || "<álggahanmearreruhtan>" "álggahan#mearre#ruhta" N Ess @OPRED "" "ođas" A Attr @AN> "" "dieđa#visti" N Sg Ill @ADVL "<.>" "." CLB <<< "<2006>" S:11106, 11106, 15045, 15313 "2006" Num Sg Acc S:3400 @OBJ "" S:9851 "stáhta#bušeahtta" N Sg Gen PxSg3 S:2936 @NQ< "" S:14633 "leat" V IV Ind Prs Sg3 S:3145 @+FAUXV "" S:8340, 14714 "ráđđehus" N Sg Nom S:3360 @SUBJ "" S:3665, 8951 "liigudit" V TV PrfPrc S:3207 @-FMAINV "<10>" S:11064, 15057, 15395 "10" Num Sg Gen S:2686 @QN> "" S:11064 "milj" ABBR Num Gen S:2647 @NumQ< "milj" ABBR Num Acc S:2647 @NumQ< "" S:3586, 9851 "kr" ABBR Gen S:2650 @NQ< "<álggahanmearreruhtan>" S:5138, 5138, 5138, 5138, 5138, 15392 "álggahan#mearre#ruhta" N Ess S:3495 @OPRED "" "ođas" A Attr S:2449 @AN> "" "dieđa#visti" N Sg Ill S:3480 @ADVL "<.>" "." CLB <<< "" "Sámediggi" N Prop Org Sg Nom @SUBJ "" "hálddašit" V TV Ind Prs Sg3 @+FMAINV "<2>" "2" Num Sg Acc @OBJ "2" Num Sg Gen @QN> "" "milj" ABBR Num Acc @NumQ< "milj" ABBR Num Gen @NumQ< "" "kr" ABBR Acc @OBJ "kr" ABBR Gen @GN> "" "doarjja#ortnet" N Sg Acc @OBJ "" "mii" Pron Rel Sg Acc @OBJ "" "Birasgáhttendepartemeanta" N Prop Org Sg Nom @SUBJ "" "leat" V IV Ind Prs Sg3 @+FAUXV "" "ruhtadit" V TV PrfPrc @-FMAINV "<.>" "." CLB <<< "<2006>" "2006" Num Sg Nom S:3360 @SUBJ "2006" Num Sg Acc S:3400 @OBJ "2006" Num Sg Gen S:2686 @QN> "2006" Num Sg Acc S:3400 @OPRED "2006" Num Sg Nom S:3360 @SPRED "" "stáhta#bušeahtta" N Sg Acc PxSg3 S:3400 @OBJ "stáhta#bušeahtta" N Sg Gen PxSg3 S:2936 @NQ< "stáhta#bušeahtta" N Sg Loc S:3482 @ADVL "stáhta#bušeahtta" N Sg Acc PxSg3 S:3400 @OPRED "" "leat" V IV Ind Prs Sg3 S:3145 @+FAUXV "leat" V IV Ind Prs Sg3 S:3145 @+FMAINV "" "ráđđehus" N Sg Nom S:3360 @SUBJ "ráđđehus" N Sg Nom S:3360 @SPRED "ráđđet" V TV Der1 Der/h Imprt Prs Sg3 S:3195 @+FMAINV "" S:3665 "liigudit" V TV Actio Nom S:3360 @SUBJ "liigudit" V TV Actio Acc S:3400 @OBJ "liigudit" V TV Ind Prs Sg1 S:3195 @+FMAINV "liigudit" V TV Actio Acc S:3400 @OPRED "liigudit" V TV PrfPrc S:3207 @-FMAINV "liigudit" V TV Actio Gen S:3498 @ADVL "liigudit" V TV Actio Nom S:3360 @SPRED "<10>" "10" Num Sg Acc S:3397 @OBJ <============= gibt's schon!! "10" Num Sg Nom S:2754 @NumN< "10" Num Sg Acc S:3397 @OPRED "10" Num Sg Gen S:2686 @QN> <============== ausschliessen? "" "milj" ABBR Num Acc S:3397 @OBJ "milj" ABBR Num Nom S:2724 @SUBJ-QH "milj" ABBR Num Nom S:2724 @SPRED "milj" ABBR Num Gen S:2647 @NumQ< "milj" ABBR Num Acc S:3397 @OPRED "" S:3586 "kr" ABBR Acc S:3400 @OBJ "kr" ABBR Nom S:3360 @SUBJ "kr" ABBR Gen S:2650 @NQ< "kr" ABBR Nom S:3360 @SPRED "kr" ABBR Acc S:3400 @OPRED "<álggahanmearreruhtan>" "álggahan#mearre#ruhta" N Ess S:3495 @SPRED "álggahan#mearre#ruhta" N Sg Nom PxSg1 S:3360 @SUBJ "álggahan#mearre#ruhta" N Sg Acc PxSg1 S:3400 @OBJ "álggahan#mearre#ruhta" N Sg Gen PxSg1 S:2886 @GN> "álggahan#mearre#ruhta" N Sg Acc PxSg1 S:3400 @OPRED "álggahan#mearre#ruhta" N Sg Nom PxSg1 S:3360 @SPRED "álggahan#mearre#ruhta" N Ess S:3495 @OPRED "" "ođas" A Attr S:2449 @AN> "" "dieđa#visti" N Sg Ill S:3480 @ADVL "<.>" "." CLB <<< ```

Sitemap

Debugging site.pages:

URL: /assets/css/style.css - Title:

URL: /ConvertingToApertium.html - Title:

URL: /KompilereFST.html - Title: Oversyn

URL: /Links.html - Title:

URL: /Numerals.html - Title: Numerals

URL: /PXdiscussion.html - Title: Begrense generering av NPx?

URL: /SpellerConfiguration.html - Title:

URL: /TheL2Transducer.html - Title:

URL: /docu-mini-smi-grammartags.html - Title: The grammatical tags

URL: /docu-sme-about-disamb.html - Title: Intro

URL: /docu-sme-bugs.html - Title: Bug reports, errors

URL: /docu-sme-dis.html - Title: The disambiguation file itself

URL: /docu-sme-flag-diacritics.html - Title: Flag diacritics

URL: /docu-sme-flowchart.html - Title: A flowchart over the sme files for morphological parsing

URL: /docu-sme-grammartags.html - Title:

URL: /docu-sme-lex.html - Title: Introduction

URL: /docu-sme-makefile.html - Title: The sme Makefile and scripts

URL: /docu-sme-preprocessor.html - Title: Introduction

URL: /docu-sme-testplan.html - Title: Test plan for sme

URL: /docu-sme-twol.html - Title: Documenting the North Saami twol rules file

URL: /gramcheck/evaluation/2021-06-24.html - Title: GramDivvun evaluation 24.6.2021

URL: /gramcheck/ - Title: The North Sámi Grammar checker project

URL: /gramcheck/meetings/2020-12-04.html - Title: Møte 26.11., 4.12., 10.12.2020

URL: /gramcheck/meetings/2021-05-19.html - Title: GRAMDIVVUN-møte 19.05.2021 10:30-12:00

URL: /gramcheck/sme-grc-beta2019.html - Title: Documentation North Sámi grammar checker beta

URL: /index-header.html - Title: North Sami documentation

URL: / - Title: North Sami documentation

URL: /j-sme.html - Title:

URL: /newcg/Grammar_Homonymy.html - Title: Data from 2013-01-24

URL: /newcg/Verb_Homonymy.html - Title: Data from 2013-01-24

URL: /newcg/Words_Homonymy.html - Title: Data from 2013-01-24

URL: /newcg/cg50top.html - Title:

URL: /newcg/sets_not_in_use.html - Title: sme-dis.rle-seahtat mat eai leat geavahusas:

URL: /normativity-issues.html - Title: Background

URL: /old-documentation.html - Title: Obsolete North Sámi documetation

URL: /possessives.html - Title: Possessive suffikser i nordsamisk

URL: /preamble.html - Title: Free and Open source Northern Sami analyser giella-sme

URL: /sets.html - Title: Noun sets

URL: /sme-fst-guide.html - Title: North Sámi (sme) fst guide for beginners

URL: /sme-syn-open.html - Title: Disambiguation of grammatical properties

URL: /sme-testdiary.html - Title: Test results for the morphology and lexicon files

URL: /sme.html - Title: North Sami language model documentation

URL: /src-cg3-disambiguator.cg3.html - Title: DELIMITERS

URL: /src-cg3-semanticroles.cg3.html - Title:

URL: /src-cg3-speech_disambiguator.cg3.html - Title: DELIMITERS

URL: /src-cg3-valency.cg3.html - Title:

URL: /src-fst-morphology-affixes-abbreviations.lexc.html - Title: Continuation lexicons for abbreviations

URL: /src-fst-morphology-affixes-acronyms.lexc.html - Title: North Saami acronyms - affix part

URL: /src-fst-morphology-affixes-adjectives.lexc.html - Title:

URL: /src-fst-morphology-affixes-nouns.lexc.html - Title:

URL: /src-fst-morphology-affixes-numerals.lexc.html - Title: North Saami numerals

URL: /src-fst-morphology-affixes-possessive-suffixes.lexc.html - Title:

URL: /src-fst-morphology-affixes-pronouns.lexc.html - Title:

URL: /src-fst-morphology-affixes-propernouns.lexc.html - Title:

URL: /src-fst-morphology-affixes-symbols.lexc.html - Title: Symbol affixes

URL: /src-fst-morphology-affixes-verbs.lexc.html - Title:

URL: /src-fst-morphology-clitics.lexc.html - Title:

URL: /src-fst-morphology-compounding.lexc.html - Title:

URL: /src-fst-morphology-phonology.bergslan.twolc.html - Title:

URL: /src-fst-morphology-phonology.twolc.html - Title:

URL: /src-fst-morphology-root.lexc.html - Title:

URL: /src-fst-morphology-stems-adjectives.lexc.html - Title: North Sámi adjective lexicon

URL: /src-fst-morphology-stems-adpositions.lexc.html - Title:

URL: /src-fst-morphology-stems-adverbs.lexc.html - Title: North Saami adverbs

URL: /src-fst-morphology-stems-conjunctions.lexc.html - Title: North Saami Conjunctions

URL: /src-fst-morphology-stems-interjections.lexc.html - Title: North Saami Interjections

URL: /src-fst-morphology-stems-nouns.lexc.html - Title: North Sámi noun lexicon !

URL: /src-fst-morphology-stems-numerals.lexc.html - Title: North Saami numerals

URL: /src-fst-morphology-stems-particles.lexc.html - Title: This file contains the Particles

URL: /src-fst-morphology-stems-pronouns.lexc.html - Title: This file contains the Pronouns

URL: /src-fst-morphology-stems-sme-abbreviations.lexc.html - Title: File containing North Saami abbreviations

URL: /src-fst-morphology-stems-sme-propernouns.lexc.html - Title: The North Saami proper noun lexicon

URL: /src-fst-morphology-stems-sme-punctuation.lexc.html - Title: Punctuation symbols

URL: /src-fst-morphology-stems-subjunctions.lexc.html - Title: The North Saami Subjunctions

URL: /src-fst-morphology-stems-verbs.lexc.html - Title: North Saami verbs

URL: /src-fst-phonetics-text2tts-fin.xfscript.html - Title:

URL: /src-fst-phonetics-text2tts-nob.xfscript.html - Title:

URL: /src-fst-phonetics-text2tts-sme.xfscript.html - Title:

URL: /src-fst-phonetics-txt2ipa.xfscript.html - Title:

URL: /src-fst-phonology-L2-from-branch.twolc.html - Title:

URL: /src-fst-phonology-L2.twolc.html - Title:

URL: /src-fst-transcriptions-transcriptor-symbols2text.lexc.html - Title:

URL: /tools-grammarcheckers-grammarchecker-resource.cg3.html - Title:

URL: /tools-grammarcheckers-grammarchecker.cg3.html - Title:

URL: /tools-grammarcheckers-grc-disambiguator.cg3.html - Title:

URL: /tools-grammarcheckers-spellchecker.cg3.html - Title:

URL: /tools-tokenisers-tokeniser-disamb-gt-desc.pmscript.html - Title: Tokeniser for sme

URL: /tools-tokenisers-tokeniser-gramcheck-gt-desc.pmscript.html - Title: Grammar checker tokenisation for sme

URL: /tools-tokenisers-tokeniser-tts-cggt-desc.pmscript.html - Title: TTS tokenisation for smj

URL: /xerox-discussion.html - Title: Introduction

Root items:

URL: /ConvertingToApertium.html - Title: Convertingtoapertium

URL: /KompilereFST.html - Title: Oversyn

URL: /Links.html - Title: Links

URL: /Numerals.html - Title: Numerals

URL: /PXdiscussion.html - Title: Begrense generering av NPx?

URL: /SpellerConfiguration.html - Title: Spellerconfiguration

URL: /TheL2Transducer.html - Title: Thel2transducer

URL: /docu-mini-smi-grammartags.html - Title: The grammatical tags

URL: /docu-sme-about-disamb.html - Title: Intro

URL: /docu-sme-bugs.html - Title: Bug reports, errors

URL: /docu-sme-dis.html - Title: The disambiguation file itself

URL: /docu-sme-flag-diacritics.html - Title: Flag diacritics

URL: /docu-sme-flowchart.html - Title: A flowchart over the sme files for morphological parsing

URL: /docu-sme-grammartags.html - Title: Docu-sme-grammartags

URL: /docu-sme-lex.html - Title: Introduction

URL: /docu-sme-makefile.html - Title: The sme Makefile and scripts

URL: /docu-sme-preprocessor.html - Title: Introduction

URL: /docu-sme-testplan.html - Title: Test plan for sme

URL: /docu-sme-twol.html - Title: Documenting the North Saami twol rules file

URL: /gramcheck/ - Title: The North Sámi Grammar checker project

URL: /index-header.html - Title: North Sami documentation

URL: / - Title: North Sami documentation

URL: /j-sme.html - Title: J-sme

URL: /normativity-issues.html - Title: Background

URL: /old-documentation.html - Title: Obsolete North Sámi documetation

URL: /possessives.html - Title: Possessive suffikser i nordsamisk

URL: /preamble.html - Title: Free and Open source Northern Sami analyser giella-sme

URL: /sets.html - Title: Noun sets

URL: /sme-fst-guide.html - Title: North Sámi (sme) fst guide for beginners

URL: /sme-syn-open.html - Title: Disambiguation of grammatical properties

URL: /sme-testdiary.html - Title: Test results for the morphology and lexicon files

URL: /sme.html - Title: North Sami language model documentation

URL: /src-cg3-disambiguator.cg3.html - Title: DELIMITERS

URL: /src-cg3-semanticroles.cg3.html - Title: Src-cg3-semanticroles.cg3

URL: /src-cg3-speech_disambiguator.cg3.html - Title: DELIMITERS

URL: /src-cg3-valency.cg3.html - Title: Src-cg3-valency.cg3

URL: /src-fst-morphology-affixes-abbreviations.lexc.html - Title: Continuation lexicons for abbreviations

URL: /src-fst-morphology-affixes-acronyms.lexc.html - Title: North Saami acronyms - affix part

URL: /src-fst-morphology-affixes-adjectives.lexc.html - Title: Src-fst-morphology-affixes-adjectives.lexc

URL: /src-fst-morphology-affixes-nouns.lexc.html - Title: Src-fst-morphology-affixes-nouns.lexc

URL: /src-fst-morphology-affixes-numerals.lexc.html - Title: North Saami numerals

URL: /src-fst-morphology-affixes-possessive-suffixes.lexc.html - Title: Src-fst-morphology-affixes-possessive-suffixes.lexc

URL: /src-fst-morphology-affixes-pronouns.lexc.html - Title: Src-fst-morphology-affixes-pronouns.lexc

URL: /src-fst-morphology-affixes-propernouns.lexc.html - Title: Src-fst-morphology-affixes-propernouns.lexc

URL: /src-fst-morphology-affixes-symbols.lexc.html - Title: Symbol affixes

URL: /src-fst-morphology-affixes-verbs.lexc.html - Title: Src-fst-morphology-affixes-verbs.lexc

URL: /src-fst-morphology-clitics.lexc.html - Title: Src-fst-morphology-clitics.lexc

URL: /src-fst-morphology-compounding.lexc.html - Title: Src-fst-morphology-compounding.lexc

URL: /src-fst-morphology-phonology.bergslan.twolc.html - Title: Src-fst-morphology-phonology.bergslan.twolc

URL: /src-fst-morphology-phonology.twolc.html - Title: Src-fst-morphology-phonology.twolc

URL: /src-fst-morphology-root.lexc.html - Title: Src-fst-morphology-root.lexc

URL: /src-fst-morphology-stems-adjectives.lexc.html - Title: North Sámi adjective lexicon

URL: /src-fst-morphology-stems-adpositions.lexc.html - Title: Src-fst-morphology-stems-adpositions.lexc

URL: /src-fst-morphology-stems-adverbs.lexc.html - Title: North Saami adverbs

URL: /src-fst-morphology-stems-conjunctions.lexc.html - Title: North Saami Conjunctions

URL: /src-fst-morphology-stems-interjections.lexc.html - Title: North Saami Interjections

URL: /src-fst-morphology-stems-nouns.lexc.html - Title: North Sámi noun lexicon !

URL: /src-fst-morphology-stems-numerals.lexc.html - Title: North Saami numerals

URL: /src-fst-morphology-stems-particles.lexc.html - Title: This file contains the Particles

URL: /src-fst-morphology-stems-pronouns.lexc.html - Title: This file contains the Pronouns

URL: /src-fst-morphology-stems-sme-abbreviations.lexc.html - Title: File containing North Saami abbreviations

URL: /src-fst-morphology-stems-sme-propernouns.lexc.html - Title: The North Saami proper noun lexicon

URL: /src-fst-morphology-stems-sme-punctuation.lexc.html - Title: Punctuation symbols

URL: /src-fst-morphology-stems-subjunctions.lexc.html - Title: The North Saami Subjunctions

URL: /src-fst-morphology-stems-verbs.lexc.html - Title: North Saami verbs

URL: /src-fst-phonetics-text2tts-fin.xfscript.html - Title: Src-fst-phonetics-text2tts-fin.xfscript

URL: /src-fst-phonetics-text2tts-nob.xfscript.html - Title: Src-fst-phonetics-text2tts-nob.xfscript

URL: /src-fst-phonetics-text2tts-sme.xfscript.html - Title: Src-fst-phonetics-text2tts-sme.xfscript

URL: /src-fst-phonetics-txt2ipa.xfscript.html - Title: Src-fst-phonetics-txt2ipa.xfscript

URL: /src-fst-phonology-L2-from-branch.twolc.html - Title: Src-fst-phonology-l2-from-branch.twolc

URL: /src-fst-phonology-L2.twolc.html - Title: Src-fst-phonology-l2.twolc

URL: /src-fst-transcriptions-transcriptor-symbols2text.lexc.html - Title: Src-fst-transcriptions-transcriptor-symbols2text.lexc

URL: /tools-grammarcheckers-grammarchecker-resource.cg3.html - Title: Tools-grammarcheckers-grammarchecker-resource.cg3

URL: /tools-grammarcheckers-grammarchecker.cg3.html - Title: Tools-grammarcheckers-grammarchecker.cg3

URL: /tools-grammarcheckers-grc-disambiguator.cg3.html - Title: Tools-grammarcheckers-grc-disambiguator.cg3

URL: /tools-grammarcheckers-spellchecker.cg3.html - Title: Tools-grammarcheckers-spellchecker.cg3

URL: /tools-tokenisers-tokeniser-disamb-gt-desc.pmscript.html - Title: Tokeniser for sme

URL: /tools-tokenisers-tokeniser-gramcheck-gt-desc.pmscript.html - Title: Grammar checker tokenisation for sme

URL: /tools-tokenisers-tokeniser-tts-cggt-desc.pmscript.html - Title: TTS tokenisation for smj

URL: /xerox-discussion.html - Title: Introduction

Directory items:

URL: /gramcheck/evaluation/2021-06-24.html - Title: GramDivvun evaluation 24.6.2021

URL: /gramcheck/meetings/2020-12-04.html - Title: Møte 26.11., 4.12., 10.12.2020

URL: /gramcheck/meetings/2021-05-19.html - Title: GRAMDIVVUN-møte 19.05.2021 10:30-12:00

URL: /gramcheck/sme-grc-beta2019.html - Title: Documentation North Sámi grammar checker beta

URL: /newcg/Grammar_Homonymy.html - Title: Data from 2013-01-24

URL: /newcg/Verb_Homonymy.html - Title: Data from 2013-01-24

URL: /newcg/Words_Homonymy.html - Title: Data from 2013-01-24

URL: /newcg/cg50top.html - Title: Cg50top

URL: /newcg/sets_not_in_use.html - Title: sme-dis.rle-seahtat mat eai leat geavahusas: