Kalaallisut NLP Grammar

Finite state and Constraint Grammar based analysers, proofing tools and other resources

View the project on GitHub giellalt/lang-kal

Page Content

Kalaallisut documentation

Maturity Lemma count License Issues Build Status

This page documents the work on the Kalaallisut language model.

Based upon a cooperation between Oqaasileriffik (Grønlands sprogsekretariat) and Divvun & Giellatekno, Oqaasileriffik has built a full-sized Greenlandic analyser.

Central developers have been: Per Langgård, Liv Molich, Nani Nilsen, Trond Trosterud, Paneeraq Nielsen, Anna Karina Madsen, Beatrine Heilman, Judithe Denbæk, Tino Didriksen and Sjur Moshagen.

It is in use in a Greenlandic spellchecker and in a set of pedagogical programs for Greenlandic.

Project documentation

Documentation and information for developers

Here you find the technical documentation of the Greenlandic parser. The parser is built in close cooperation with the Sámi language technology project, with a file structure and setup close to the Sámi ones. General documentation can be found at the Sámi project documentation.

General information for web users

In order to analyse Greenlandic text, you may paste in text in the window here. The machine returns text with all possible analyses. The tags are listed in the document root.lexc linked to below. The parser gives tags for parts of speech, morphosyntactic categories, clitecs, and derivational tags. The derivational affixes have tags carrying the same names as the affixes themselves. In addition to giving all possible tags, the analyser also has a disambiguation mode, that gives only analyses appropriate in the given context.

It is also possible to generate wordforms, by pasting the lemma and gramamtical tags here

In-source documentation

Below is an autogenerated list of documentation pages built from structured comments in the source code. All pages are also concatenated and can be read as one long text here.