GiellaLT Documentation

GiellaLT provides rule-based language technology aimed at minority and indigenous languages

View GiellaLT on GitHub

Page Content


Technical documentation for the Freiburg-Helsinki-Tromsø Speech Corpora for Sámi, Komi and other languages.

This document is meant for project collaborators. The documentation will now and then also be relevant for anyone wanting to use our corpora or the annotation tools created by us.


The Freiburg-Helsinki-Tromsø Corpora are built in collaboration between the Freiburg Research Group for Sámi Studies, Kielipankki in Helsinki, Giellatekno in Tromsø, and members of the respective speech communities.

Our approach is to combine knowledge and tools from the applied fields of Language Documentation (often focussing on ARCHIVING, rather than on annotation) and Language Technology (often focussing on WRITTEN, rather than on spoken language) with the main goal to annotating systematically and making available the largest possible variety of language samples for further corpus-based applied and theoretical research.


Language documentation and description

Pite Sámi Documentation Project (PSDP)


Other (current and former) main collaborators

Kola Sámi Documentation Project (KSDP)


Other (current and former) main collaborators

Izhva Komi Documentation Project (IKDP)

FU-Lab Project Wiki (in Russian/Komi)

Leaders (of corpus work)

Other (current and former) main collaborators

Theoretical linguistics

A Corpus-based microtypology of word order in varieties of Sámi and Kurdish (WOSK)

Leaders (of corpus work)

Documentation Pages

Common documentation for our spoken data archive

Common documentation for our working repository for corpus data

Common documentation for data annotation tools

Common documentation for metadata conventions

Description of GRAID Annotation Conventions (used in WOSK)



21.05.2014 07.05.2014


11.04.2014 18.03.2014 13.01.2014


14.05.2014 30.04.2014 08.04.2014

Documentation of Related Projects in Collaboration with Giellatekno