GiellaLT Documentation

GiellaLT provides rule-based language technology aimed at minority and indigenous languages

View GiellaLT on GitHub

Page Content

This page documents conventions, standards and relevant workflows used for the annotation of the Freiburg-Tromsø Corpora according to the GRAID conventions used for the investigation of “A Corpus-based microtypology of word order in varieties of Saami and Kurdish (WOSK)”.


GRAID (Grammatical Relations and Animacy in Discourse) is a set of annotation conventions to be used for the cross-linguistic investigation of grammatical relations. GRAID was developed by Geoffrey Haig (Uni Bamberg) and Stefan Schnell (La Trobe Uni), see the GRAID website.


GRAID annotations are written manually in a special annotation tier, e.g. in ELAN. *Can we produce GRAID annotations (semi-) automatically?


In GRAID syntactic class (parts-of-speech) and syntactic function (direct object, predicate, etc.) are merged into one annotation tier. This seems less usefull. We should rather use separate tiers. If done manually we should also consider using closed vocabularies for inserting the annotation values. But at least the tags for syntactic classes could in principle be provided by using FST. *Create a better data structure for our GRAID-relevant ELAN-annotations!