Kven Finnish NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-fkv

Phonological rules for Kven

This file documents the phonology.twolc file

We first define alphabets and sets. Thereafter come the rules.

Alphabets and sets

Alphabet

The letters

The archiphonemes

Letters with deviant behaviour

Triggers

Dialect tags

Literal quotes and angles

Thesemust be escaped (cf morpheme boundaries further down):

Sets

Development principles: — NO UNCLEAR SEQUENCES WITHOUT AN EXPLANATION (and TESTS): (Cns:) :Cns+ Cns: (:Cns) — One TRIGGER, one change! No ^AO that means a:o and a:0 and a:i, then 3 DIFFERENT triggers

Trigger order

(to be completed). The triggers should be in this order both in lexc and here in twolc.

stem Dial: WG {T0,TJ} {E0,I0,E2I,A2I,AO,AE,VDEL,UU} HMETA > {i2:,i3:,i4:](i5:} > suffixes

Rules

(Divided into consonant and vowel rules)

Consonant rules

Gemination rules

Pitkän vokaalin jälkeen ja kans painottoman tavun jälkeen k, t, p ja s geminoituvat ko perässä on pitkä vokaali (= lounaismurteitten erikoisgeminaatio), mutta muut konsonantit geminoituvat vaan lyhyen painollisen tavun jälkeen (= yleisgeminaatio). (ES).

Rule: Gemination 0:h

Rule: Gemination 0:j

Rule: Gemination 0:k

Tests:

Rule: Gemination 0:l

Rule: Gemination 0:m

Rule: Gemination 0:n

Rule: Gemination 0:p

Rule: Gemination 0:r

Rule: Gemination 0:s

Rule: Gemination 0:t

Rule: Gemination 0:v

Gradation rules

Rules for p gradation

Rule: Gradation p:0 (pp:p)

Rule: Gradation p:v

Rule: Gradation mp:mm

Tests:

Rules for k gradation

Rule: Gradation i6:0, in word poika: pojan

Tests:

TODO: When k:j and when k:0 between e and i.

Rule: Gradation k:j

Rule: Gradation k:0

Tests:

Tests:

Rule: Gradation k3:0

Tests:

Rule: Gradation k:v

Tests:

Rule: Gradation nk:ng

Tests:

Gradation t

Rule: Gradation Nt:NN in first syllable after short vowel

Rule: Gradation t:0 for tt:t, Nt:N and vuote:vuoeksi

Tests:

Rule: ti:si

Tests:

Rule: t:j in Var variant vuojeksi

Rule: o:u in vuosi vuote vuoet -> vuuet optional variant

Tests:

Rule: Gradation t:đ

Tests:

Assimilation rules

Rule: Alveolar assimilation for consonant stem l

Rule: Alveolar assimilation for consonant stem r

Rule: Alveolar assimilation for consonant stem s

Rule: j:0 in front of i

Vowel rules

Vowel harmony rules

Thefa idea of having f.ex V:e, always to a specific vowel, is to not have conflicts in twolc compilation. This improves compilation time (we assume) and it make Twolc behave more predictable. Weird things happen sometimes with conflicts! The downside is that often you have linguistic rules for vowels that are similar for a group of vowels! This is most of case. And now you have to edit it for all vowel separately one by one. Hopefully, we can make sure we don’t forget to do it too often, by having more user feedback, especially from the paradigms in the dictionary.

For each Vowel separately

Rule: ^V:e

Tests:

Rule: ^V:a

Tests:

Rule: ^V:ä

Tests:

Rule: ^V:i

Tests:

Rule: ^V:o

(the old system)

with variables (Vx/Vy) instead of each vowel separately

Rule: Back harmony for %^A: %^O: %^U:

Tests:

Vow copying and metathesis

Rule: Vow copying in short h-illative and short partitive sg

Tests:

Rule: Vow copying in partitive of words ending in io, ia

Rule: Vow copying in long h sg forms both part one and part two

Rule: Vow copying in long h pl forms

Rule: a to o and metathesis in h forms in pl of a-stems

Rule: Stem deletion in h-illative

Tests:

Stem alternation rules

e rules

Rule: e:i in nom.sg. of e-stems and in n_23ia kauhia hopia in Var

Rule: e:0 in consonant stems and illative plural

Tests:

Cns:0 in hoppe- hope-a in Var @RULENAME@ Jok

i rules

The -i- rules require different i-s for different POS.

Rule: i:0

a rules

Rule: a:0 before Pret and Pl i when rounded root vowel

Tests:

Rule: **a:o before Pl i and Pret i **

Rule: ä:ö before Pl i

Tests:

Rule: a:i in 3-syll stems with long a and i

Tests:

Rule: a:i in 3-syll stems with long a and i

Shortening

Rule: Shortening of long vowel in front of i

Other Vowel rules

(two A:e rules and one ä:0)

Rule: a:e in comparative

Tests:

Rule: a:e in passives

Rule: ä:0

Tests:

Rule: ö:0

Rule: o:0

Gemination tests

Tests:

Rule: o:0

Rule: o:0

Rule: o:0


This (part of) documentation was generated from src/fst/morphology/phonology.twolc