GiellaLT provides an infrastructure for rule-based language technology aimed at minority and indigenous languages, and streamlines building anything from keyboards to speech technology. Read more about Why. See also How to get started, and our Privacy document.
The following is the beginning of documentation of the different utility regexes found in $GTHOME/gt/common/src/
. To be extended as more is found.
In order to make pos.fst we need a binary tag-pos.fst This goal depends on tag-pos.regex. The way it is done is that all tags except the POS one are deleted.
We want to delete the +TV +IV tags for the generator (and other tags later on. For that we need our tag-deleter.
No documentation yet
No documentation yet
No documentation yet
No documentation yet
This goal is to make a regex for filenames, urls and mail addresses