North Sami Text-to-Speech

Finite state and Constraint Grammar based Text-to-Speech processing

View the project on GitHub giellalt/speech-sme

Page Content

This document is an overview of the work of assembling and editing texts for reading, ie the texts used in recording the voices.

Assembling different types of texts

Considering that we want our end product to be able to read “everything”, the texts must range from formal language to colloqial language. The different styles show different preferences for long words, possessive suffixes and particles, which in turn has different implications for prosody. We need a good mixture of these styles.

Editing texts

The texts that are chosen are not totally authentic. They have been altered to accommodate reading fluency. Some texts have not been proofread properly before publishing: