Mansi NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-mns

The Mansi morphophonological/twolc rule file

This file documents the phonology.twolc file

Скрибник Е.К. & К.В. Афанасьева 2007: Практически курс манийского языка, Часть 1. Ханты-Мансийск. Полиграфист.

Alphabet, sets and definitions

Alphabet

Basic letters:

Letters with diacritical marks: Precomposed letters are used when available, otherwise non-composed ones.

Archiphones

Triggers

%{VO%}:0 Stem ending in vowel other than и ы %{VI%}:0 Stem ending in vowel и or ы %{SYNCH%}:0 Stem with syncope with и, ы, у hard %{SYNCS%}:0 Stem with syncope with и, ы, у soft %{NOSYNCH%}:0 Stem without syncope with и, ы, у hard %{NOSYNCS%}:0 Stem without syncope with и, ы, у soft %{VCH%}:0 Stem ending in single hard consonant %{VCCH%}:0 Stem ending in hard consonant cluster %{VCS%}:0 Stem ending in single soft consonant %{VCCS%}:0 Stem ending in soft consonant cluster

%{VA%}:0 -аӈкве verb %{VU%}:0 -уӈкве verb

Literal quotes and angles

These must be escaped (cf morpheme boundaries below):

Boundaries

%> # for suffix and word end

Morpheme boundaries:

End of alphabet definitions

Sets

Definitions

Rules

RULE: Syncope for ы

тэ̄ӈкве+V+Act+Ind+Prt+ScSg1+OcSg: eat/syödä

RULE: Syncope for и

RULE: Syncope for у

ла̄тыӈ+N+Sg+PxPl1+Nom: language/kieli

vowel lengthening RULE: a becomes long

RULE: ja becomes long

RULE: y becomes long

RULE: long y disappears

Tests:

Tests:

Tests:

павыл+N+Sg+Loc:

Tests:

Tests:

Tests:

Tests:

RULE: Soft sign deletion for и suffixes 2023.02 ансамбль+N+Sg+Loc: band/yhtye ансамбль+N+Pl+Nom: band/yhtye ансамбль+N+Sg+Loc: band/yhtye

Tests:

ха̄ль+N+Sg+Nom+PxSg3

павыл+N+Pl+Nom:

%{аяØ%}:0 deletes а/я after VO vowel stems

%{аяØ%}:а

%{аяØ%}:я

%{уюØ%}:0 deletes а/я after VO vowel stems

%{уюØ%}:у ща̄рыщ+N+Sg+Nom+PxSg1: sea/meri

%{уюØ%}:ю коюӈкве+V+Inf: track/jäljittää, vuottaa

%{аяLong%}:а̄

%{аяLong%}:я̄

%{аяLong%}:0

%{ЭЕLong%}:э̄ аки

%{ЭЕLong%}:э̄

{ЭЕ}:э

{ЭЕ}:е

%{ЫИ%}:ы тотуӈкве+V+Ind+Prs+ScDu1+OcDu:

%{ЫИ%}:и

CONSONANTS

й:0 коюӈкве+V+Inf: **

ха̄ль+N+Sg+PxSg3+Nom

хансуӈкве+V+PrsPrc: write/kirjoittaa

таӈхуӈкве+V+Act+Ind+Prt+ScSg1: __


This (part of) documentation was generated from src/fst/morphology/phonology.twolc