GiellaLT

GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology.

View GiellaLT on GitHub

Page Content

  • Installing HFST, our linguistic compiler
  • Some alternative compilers, strictly speaking not needed
  • Installing an editor
  • Now go back to to Getting Started page for the next step towards building, using and developing the linguistic analysers.
  • Getting started with the GiellaLT infrastructure on Linux

    This page is a part of the overall Getting started documentation. It describes what you need to install on Linux to be ready to develop language tools for your language.

    Note that this documentation is relevant when you want to participate in building and developing the grammatical tools yourself. If you only want to use the ready-made grammatical analysers, see the Linguistic analysis page.

    Installing required auxiliary programs

    You need a number of tools for the build chain. Installation differs depending on what Linux distribution you have, Ubuntu, Fedora or Centos:

    Ubuntu (all this in one command)

    sudo apt-get install autoconf automake libtool libsaxonb-java python3-pip \
    python3-lxml  python3-bs4 python3-html5lib libxml-twig-perl antiword xsltproc \
    poppler-utils wget python3-svn wv python3-feedparser subversion openjdk-11-jdk cmake \
    python3-tidylib python3-yaml libxml-libxml-perl libtext-brew-perl icu-devtools
    

    Fedora (18) (this is 3 separate commands)

    sudo yum install autoconf automake libtool saxon python-pip \
    python-lxml python-beautifulsoup4 python3-PyYAML \
    python-unittest2 perl-XML-Twig antiword pysvn wv libxslt poppler python-tidy \
    wget python-feedparser subversion java-1.7.0-openjdk cmake cpan perl-XML-LibXML
    
    sudo pip install pyth pytidylib
    
    sudo cpan install Text::Brew
    

    Centos 6.4 (this is 3 separate commands)

    sudo yum install autoconf automake libtool saxon python-pip \
    python-lxml python-beautifulsoup4 \
    python-unittest2 perl-XML-Twig antiword libxslt poppler \
    wget python-feedparser wv pysvn subversion java-1.7.0-openjdk cmake cpan perl-XML-LibXML
    
    sudo pip install pyth pytidylib
    
    sudo cpan install Text::Brew
    

    Installing HFST, our linguistic compiler

    You need tools to convert your linguistic source code (lexicons, morphology, phonology, syntax, etc.) into usefull tools like analysers, generators, hyphenators and spellers.

    NB! The information below is up-to-date as of HFST 3.16.

    Run ONE OF these sets of commands (ubuntu or fedora, if you installed Linux-on-Windows as part of this documentation, you should choose the ubuntu commands):

    On Linux ubuntu:

    wget https://apertium.projectjj.com/apt/install-nightly.sh -O - | sudo bash
    
    sudo apt-get -f install apertium-all-dev
    

    On Linux fedora (e.g. on gtlab):

    curl https://apertium.projectjj.com/rpm/install-nightly.sh |sudo bash
    
    sudo apt-get -f install apertium-all-devel
    

    This downloads a shell script (1), makes it executable (2), and runs it (3). The shell script in turn will download and install prebuilt binaries for programs for morphology, syntax and machine translation:

    You get the latest version of all required tools in one go, no compilation required! :) Rerun with regular intervals to get the latest updates.

    This is our default compiler, and it builds all our tools. It is open source, and it is needed for turning your morphology and lexicon into spellcheckers and other useful programs.

    Some alternative compilers, strictly speaking not needed

    The following two programs are not needed, we just refer to them since the source code is compatible with them:

    Installing an editor

    You will need a text editor for writing the source files. Here are some suggestions:

    Now go back to to Getting Started page for the next step towards building, using and developing the linguistic analysers.