GiellaLT

GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology. Read more about Why. See also How to get started and our Privacy document.

View GiellaLT on GitHub

Corpus DTD

Introduction

Draft [X] Proposal [ ] Approved [ ]

Version Date Explanation Author
0.1 14.3.2005 Initial transformation from newsgroup Tomi

Version history

Content structure

document (header, body)

    header (fileDesc, sourceDesc)
        fileDesc (title, translatedFrom, genre)
            title
            traslatedFrom (nob|nno|fin|swe|...)
            genre (fact|fiction|religion)
              fact (adm|news|science|...)
              fiction (prose|poetry|...)
              religion (bible|...)

        sourceDesc (author, year, place, publDesc)
            author (name, age, sex, nationality)
            year
            place
            publDesc (publisher, idno)
              idno



    body (text)
    text (title | paragraph | list | table | section) +

        section (title | paragraph | list | table | section) +
        list (li+)
        table (row+)
            row (cell+)

Formal TEI conformant DTD

To be added