GiellaLT

GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology.

View GiellaLT on GitHub

Page Content

Make or update spellers for Voikko+HFST

Compiling the spellers

The resulting spellers are available both for OpenOffice/LibreOffice, and for command-line tools. The command-line tools and the test bench require that you also build and install libvoikko.

Languages with analysers working to some degree or better are:

North Saami, South Saami, Lule Saami, Inari Saami, Kven, Meänkieli, Komi Zyrian, Erzya Mordvin, Meadow Mari, Hill Mari, Kven, Faroese, Greenlandic, Tundra Nenets.

This is enough to enable hfst-based spell-checkers in LibreOffice (tested).

To build and test:

  1. Compile your language with hfst support (here: kpv as example)
    cd $GTLANGS/lang-kpv
    ./configure --with-hfst --enable-spellers
    make
    sudo make install

(The new speller will then turn up in /usr/local/share/voikko/3/ as kpv.zhfst with today’s date.)

  1. in LibreOffice, open Preferences > Language Settings > Voikko and check that the newly installed language is listed (and thus known to Voikko+LibreOffice).

Using the spellers in OpenOffice / LibreOffice

This can be done with a simple installation of an extesion. The steps are as follows:

That’s it! Your own transducer directly applied as spellers!

CAVEATS!!!

This is Beta quality, at best

Både hfst som ein del av Voikko og hfst-transdusarane våre er langt i frå ferdige! Det kan finnast problem, og det er kjende feil. Alle lingvistiske problem skal meldast til Github Issues (eller rettast direkte), alle tekniske feil med Voikko, HFST eller OOo-tillegget skal meldast til Sjur eller til Libvoikko-e-postlista (http://lists.puimula.org/listinfo/libvoikko).

Known errors:

Kjende forbetringar i høve til Hunspellstavekontrollane våre:

Running the test bench with voikko+hfst

This requires libvoikko. The commands are:

cd $GTHOME/gt
make <TESTTYPE> GTLANG=sme TESTTOOL=vkhfst

where <TESTTYPE> is one of:

or spelltest to run them all.

Using voikko on the command line

This also requires libvoikko. Just type:

voikkospell -s -d se

See man voikkospell for more details and options.

System-wide speller

Eventually, we will get VoikkoSpellService with support for HFST on our computers, which will integrate the voikko speller in the system-wide speller for all languages we compile and install. But at the moment only a single language can be installed, and only in a specific location.