This procedure is offered as a workaround while waiting for our hyphenators.
At present (2022) the GiellaLT framework cannot offer hyphenation integrated in its proofing tools. What we can offer, however, is a hyphenation component, either based upon the phonological structure of the language or upon both phonological and morphological cues.
In some cases, e.g. when wanting to publish a book, correct hypehenation becomes important. This procedure shows how a book manuscript may be hyphenated also whan the hyphenation tools are not (yet) integrated in the spellcheckers.
We assume that the manuscript is available in plain text format, in a file here called manuscript.txt, and that you have downloaded the
lang-xxx catalogue from github (
xxx being the ISO code for you language), as found here. If so, do the following:
cat manuscript.txt |tr '\-' '‰' | hfst-lookup -q tools/hyphenators/hyphenator-gt-desc.hfstol > hyph-manuscript.txt
hyph-manuscript.txtin Microsoft Word
That’s it! In 7 simple steps (!), you now have a book manuscript with hyphen boundaries exactly where you want to have them.