Konferenzbeitrag

On the role of duration prediction and symbolic representation for the evaluation of synthetic speech

In order to determine priorities for the improvement of timing in synthetic speech this study looks at the role of segmental duration prediction and the role of phonological symbolic representation in listeners' preferences. In perception experiments using German speech synthesis, two standard duration models (Klatt rules and CART) were tested. The input to these models consisted of symbolic strings which were either derived from a database or a text-to-speech system. Results of the perception experiments show that different duration models can only be distinguished when the symbolic string is appropriate. Considering the relative importance of the symbolic representation, "post-lexical" segmental rules were investigated with the outcome that listeners differ in their preferences regarding the degree of segmental reduction. As a conclusion, before fine-tuning the duration prediction, it is important to calculate an appropriate phonological symbolic representation in order to improve timing in synthetic speech.

On the role of duration prediction and symbolic representation for the evaluation of synthetic speech

Urheber*in: Brinckmann, Caren; Trouvain, Jürgen

Urheberrechtsschutz

Sprache
Englisch

Thema
Deutsch
Automatische Sprachproduktion
Lautquantität
Germanische Sprachen; Deutsch

Ereignis
Geistige Schöpfung
(wer)
Brinckmann, Caren
Trouvain, Jürgen
Ereignis
Veröffentlichung
(wer)
Baixas : ISCA
(wann)
2017-12-20

URN
urn:nbn:de:bsz:mh39-68610
Letzte Aktualisierung
06.03.2025, 09:00 MEZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
Leibniz-Institut für Deutsche Sprache - Bibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Objekttyp

  • Konferenzbeitrag

Beteiligte

  • Brinckmann, Caren
  • Trouvain, Jürgen
  • Baixas : ISCA

Entstanden

  • 2017-12-20

Ähnliche Objekte (12)