GiellaLT

GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology.

View GiellaLT on GitHub

Page Content

This page documents conventions, standards and relevant workflows used for the annotation of the Freiburg-Tromsø Corpora according to the GRAID conventions used for the investigation of “A Corpus-based microtypology of word order in varieties of Saami and Kurdish (WOSK)”.

Intro

GRAID (Grammatical Relations and Animacy in Discourse) is a set of annotation conventions to be used for the cross-linguistic investigation of grammatical relations. GRAID was developed by Geoffrey Haig (Uni Bamberg) and Stefan Schnell (La Trobe Uni), see the GRAID website.

Questions

GRAID annotations are written manually in a special annotation tier, e.g. in ELAN. *Can we produce GRAID annotations (semi-) automatically?

Tasks

In GRAID syntactic class (parts-of-speech) and syntactic function (direct object, predicate, etc.) are merged into one annotation tier. This seems less usefull. We should rather use separate tiers. If done manually we should also consider using closed vocabularies for inserting the annotation values. But at least the tags for syntactic classes could in principle be provided by using FST. *Create a better data structure for our GRAID-relevant ELAN-annotations!