GiellaLT

GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology. Read more about Why. See also How to get started, and our Privacy document.

View GiellaLT on GitHub

Page Content

Intro
Projects
- Language documentation and description
- Theoretical linguistics
  - A Corpus-based microtypology of word order in varieties of Sámi and Kurdish (WOSK)
Documentation Pages
Meetings
Documentation of Related Projects in Collaboration with Giellatekno

Freiburg

Technical documentation for the Freiburg-Helsinki-Tromsø Speech Corpora for Sámi, Komi and other languages.

This document is meant for project collaborators. The documentation will now and then also be relevant for anyone wanting to use our corpora or the annotation tools created by us.

Intro

The Freiburg-Helsinki-Tromsø Corpora are built in collaboration between the Freiburg Research Group for Sámi Studies, Kielipankki in Helsinki, Giellatekno in Tromsø, and members of the respective speech communities.

Our approach is to combine knowledge and tools from the applied fields of Language Documentation (often focussing on ARCHIVING, rather than on annotation) and Language Technology (often focussing on WRITTEN, rather than on spoken language) with the main goal to annotating systematically and making available the largest possible variety of language samples for further corpus-based applied and theoretical research.

Projects

Language documentation and description

Pite Sámi Documentation Project (PSDP)

Leader

Joshua Wilbur

Other (current and former) main collaborators

Miriam Hecker (MA student Freiburg)

Kola Sámi Documentation Project (KSDP)

Leader

Michael Rießler

Other (current and former) main collaborators

Maryna Litvak (student assistant Freiburg)
Julia Reitze (MA student Freiburg)
Evgeniya Zhivotova (PhD student Leipzig)

Izhva Komi Documentation Project (IKDP)

FU-Lab Project Wiki (in Russian/Komi)

Leaders (of corpus work)

Michael Rießler
Niko Partanen (PhD student Freiburg)

Other (current and former) main collaborators

Rogier Blokland (project leader Uppsala)
Andrej Chemyshev (FU-Lab Syktyvkar)
Vasili Chuprov (student assistant Syktyvkar)
Marina Fedina (project leader Syktyvkar)
Alexandra Kellner (student assistant Helsinki/Freiburg)
Ënyë Lav (FU-Lab Syktyvkar)

Theoretical linguistics

A Corpus-based microtypology of word order in varieties of Sámi and Kurdish (WOSK)

Leaders (of corpus work)

Michael Rießler
Hanna Thiele (PhD student)

Documentation Pages

Common documentation for our spoken data archive

The Language Archive (TLA)

Common documentation for our working repository for corpus data

Common documentation for data annotation tools

Common documentation for metadata conventions

Metadata

Description of GRAID Annotation Conventions (used in WOSK)

Meetings

Kildin Sámi Lexicography
Komi Lexicography
Oahpa!-nuõrti
Pite Sámi Lexicography
Skolt Sámi Lexicography

Freiburg

Intro

Projects

Language documentation and description

Pite Sámi Documentation Project (PSDP)

Kola Sámi Documentation Project (KSDP)

Izhva Komi Documentation Project (IKDP)

Theoretical linguistics

A Corpus-based microtypology of word order in varieties of Sámi and Kurdish (WOSK)

Documentation Pages

Meetings

General

IKDP

WOSK

Documentation of Related Projects in Collaboration with Giellatekno

Sitemap