Preparing text for TTS
Collect enough text to be read in as training material. The model may be built based on 3-12 hours, a good target is 10 hours speech. This should exual appr. 45000-50000 words. Collecting and especially prepararing the text may take several months.
Keep in mind:
- The text should cover digraph sequences, consonant gradation strings, etc.
- The text should be balanced topic-wise
- It should contain numbers of different types
- It should also contain loan words
Links to text collections
forthcoming.