Finite state and Constraint Grammar based analysers, proofing tools and other resources
View the project on GitHub giellalt/lang-crk
Meeting Jan 16
Participants: Antti, Conor, Trond - later: Sjur
syllabics conversion
Goal:
TODO
Trond to:
Sjur to:
Conor to:
How to have a speller
go to langs/crk, and issue these commands:
./configure --with-hfst --enable-spellers
make
sudo make install
To update the speller:
To repeat this on Windows:
crk/tools/spellcheckers/fstbased/hfst/crk.zhfst
to-be-added
[http://www.ualberta.ca/~arppe/PlainsCree.html]
The text:
nitêminân nipâw sisone iskwatemihk.
dog+N+Pl1Ex sleeps+V beside+PART door+N+LOC
waniskâw kîkisepâ.
wakes.up+V morning+N
waniskâw ekwa nohtekatew.
wakes.up+V and+PART is.hungry+V
wâpahtam ôskanisis wiyâkanihk ekwa mîciw.
sees+V bone+N+DIM bowl+N+LOC and+PART eats+V
ekota-ohci nôhkwâtam wiyâkan.
here.from+PART licks+V bowl+N
keyâpic nohtekatew ekwa kâwe nipâw.
still+PART is.hungry+V and+PART again+PART sleeps+V
Issues to fix:
Working with a productive analyser for now.
Conor has checked files and added verb types. Different verb types are to be checked in.
Subjunctive prefix ê-
In writing:
Solution:
Two prefixes in lexc:
and then to h deletion before %^eh in twolc
The tag is now +Sbj, an alternative is +Conjunct mode, +Cnj, so we could do that.
Decided: Use +Cnj
More words in the dictionary, especially the words of the text.
Input here: comma separated stuff to crkeng/inc/
:
main/words/langs/crkeng/inc/nouns.csv
cat inc/nouns.csv
atim n dog
inini n man
nâpês n boy
apiscacihkos n antelope
mâyatihk n bighorn sheep
atihk n caribou
apisimôsnos n deer
wâwaskêsniw n elk
mistatim n horse
môswa n moose
maskwa n bear
okistatowân n grizzly bear
wâpask n polar bear
sîsîp n duck
môhkomân n knife
sakâw n soup
mîcimâpo n soup
sîsîpâwi n duck egg
wâwi n egg
iskwêw n woman
nipâw v sleep
wâpam v see
<e>
<lg>
<l pos="N">sakâw</l>
</lg>
<mg>
<tg>
<t pos="N">soup</t>
</tg>
</mg>
</e>
Results to be added to
main/words/langs/crkeng/src/N_crkeng.xml
Procedure for updating the dictionary (needed: account on the gtweb machine):
[/dicts/nds/NDSUpdatingDictionaries.html]
(with pikiskwewina or guusaaw as the variable for DICT)
ssh neahtta@gtweb.uit.no
\
with a passwordMany writers do not write ê. The analyser handles both, but we should consistently always write e.g. with macron in the code.