GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology.

View GiellaLT on GitHub

Page Content

Corpus Resources

Warning Under construction.

This page contains a dynamically built list of all corpus repositories. Private repositories are not listed.


Grouped according to geography

Languages of the Nordic countries

Languages of Russia

Other European languages

Languages in North America

Languages in Africa

Languages in other parts of the world

Languages with no geography tag

Grouped according to language family

Uralic Languages

Eskimo-Aleut Languages

Algic Languages

Indoeuropean languages

Niger-Congo Languages

Turkic Languages

Languages of other language families, isolates, artificial languages

Languages with no language family tag