Woods Cree morphological analyser
INTRODUCTION TO MORPHOLOGICAL ANALYSER OF Plains Cree LANGUAGE.
Definitions for Multichar_Symbols
Analysis symbols
The morphological analyses of wordforms of Plains Cree are presented
in this system in terms of the following symbols.
(It is highly suggested to follow existing standards when adding new tags).
POS
- +N = Noun
- +V = Verb
- +Ipc = Indeclinable Particle
- +Prop
- +Adv
- +CC
- +CS
- +Interj
- +Phr
- +Pron
- +Num
- +Arab
- +Rom
- +PUNCT = punctuation symbols
- +LEFT = the left part of a paired punctuation symbol
- +RIGHT = the right part of a paired punctuation symbol
- +CLB = clause boundary symbols
- +Symbol = independent symbols in the text stream, like £, €, ©
- +ABBR
Nominal morphology
- +Loc Locative
- +Obv Obviative
-
+Voc Vocative
- +Dim Diminutive
Particles
- +Def This is the intransitive demonstrative, i.e. the definite.
-
+Indef Indefinite
- +Dem Demonstrative
- +Prox Demonstrative Proximate
- +Med Demonstrative Medial
- +Dist Demonstrative Distal
- +Pers = personal pronouns? At least it seems so based on the code
- +Interr Interrogative (who/whose/what/what kind)
- +Foc Focus particle
ordinals
Verbal MSP
Person prefix fragment features
Nominal morphosyntactic features
Verb conjugation (transitivity + animacy classes)
- +AI intransitive with animate subject,
- +II intransitive with inanimate subject,
- +TA transitive with animate object, and
- +TI transitive with inanimate object.
Noun animacy and dependency classes
- +A animate noun
- +I inanimate noun
-
+D dependent noun
- +Qst yes-no question particle; cî
- +Neg negation; [na]môy[a].
Preverbs
Auxiliary symbols
These symbols either shape or govern the
morphophonological structure
- %> suffix border
- %< prefix border
Symbols that need to be escaped on the lower side (towards twolc):
- »7: Literal »
- «7: Literal «
%[%>%] - Literal >
%[%<%] - Literal <
Special characters for morphophonology
- w2 mowêw:mow2
- t2 Epenthetic -t- between person prefixes and vowel-initial stems
- t3 t to s in VTA-4
- t4 t:c in VTI-1 with unspecified actor
- y2 epenthetic joiner in reduplication of vowel-initial stems
- y3 epenthetic joiner in reduplication of vowel-initial stems
-
i2 vta-5i epenthesis.
- h2 Prefix in possessives
Triggers for various morphophonological phenomena
Mostly, these are not realized themselves as any grapheme/phoneme
- %^EGLOT glottal stop after e, for eh- in conjunctive order
These tags distinguish different special-purpose analysers
and generators from each other. Thus, for examples, we have
normative and descriptive analysers, and generators for different purposes.
- +Err/Orth tag for substandard forms
- +Err/Frag tag for word-form fragments
- +Err/Morph tag for nonstandard morphology
- +Err/Thm tag for nonstandard possessive theme (0 vs. -im) morphology
- +Err/Dim tag for nonstandard diminutive morphology (-is vs. -isis)
- +Err/Dummy tag for dummy lexemes used for testing purposes
- +Dial tag for dialectical forms that can’t be called errors
- +Dial/East tag for dialectical forms that can’t be called errors
- +Dial/West tag for dialectical forms that can’t be called errors
- +Var tag for dialectical forms that can’t be called errors
- +Var/East tag for dialectical forms that can’t be called errors
- +Var/West tag for dialectical forms that can’t be called errors
- +Use/NG not-generate, for ped generation isme-ped.fst
- +Eng indicates that this is an English form
Flagdiacritics
These are documented in Chapter 8 of Beesley/Karttunen, p. 456 zB.
For indicative, there are prefixes, so here we need one
flag for each person-number combination. Note that
for the inverse objective conjugation, the flag refers to
the prefix, not to the subject. So indsg1 refers to either
subject = 1Sg or object = 1Sg. The 3-3 forms are prefixless.
The conjunct form always has
the ê- prefix, and future conditional never has a prefix.
- @U.verb.FutCon@ Future Conditional
Prefixes with a certain phonological content:
- @U.person.NULL@
- @U.person.NI@
- @U.person.KI@
Order
- @U.order.indep@ Independent
- @U.order.cnj@ Conjunct
- @U.order.imp@ Imperative
Tense
New multichar symbols for nouns
End of new and all Multichar_Symbols
LEXICON Root is where it all starts
- NOUN_PREFIXES ;
- NOUN_IRREGULARS ;
- Vocative_Nouns ;
- VerbPrefixes ;
- Pronoun ;
- Propernouns ;
- Particles ;
- Numerals ;
- Abbreviation ;
- Punctuation ;
- Symbols ;
- NON_STANDARD ;
This (part of) documentation was generated from src/fst/morphology/root.lexc