GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology. Read more about Why. See also How to get started and our Privacy document.
This document keeps track of measures to improve the corpus collection and conversion process. Note also the sentence alignment page, which looks into that specific sub-part of the corpus maintenance.