Konferenzbeitrag

On the role of duration prediction and symbolic representation for the evaluation of synthetic speech

In order to determine priorities for the improvement of timing in synthetic speech this study looks at the role of segmental duration prediction and the role of phonological symbolic representation in listeners' preferences. In perception experiments using German speech synthesis, two standard duration models (Klatt rules and CART) were tested. The input to these models consisted of symbolic strings which were either derived from a database or a text-to-speech system. Results of the perception experiments show that different duration models can only be distinguished when the symbolic string is appropriate. Considering the relative importance of the symbolic representation, "post-lexical" segmental rules were investigated with the outcome that listeners differ in their preferences regarding the degree of segmental reduction. As a conclusion, before fine-tuning the duration prediction, it is important to calculate an appropriate phonological symbolic representation in order to improve timing in synthetic speech.

On the role of duration prediction and symbolic representation for the evaluation of synthetic speech

Urheber*in: Brinckmann, Caren; Trouvain, Jürgen

In copyright

Language
Englisch

Subject
Deutsch
Automatische Sprachproduktion
Lautquantität
Germanische Sprachen; Deutsch

Event
Geistige Schöpfung
(who)
Brinckmann, Caren
Trouvain, Jürgen
Event
Veröffentlichung
(who)
Baixas : ISCA
(when)
2017-12-20

URN
urn:nbn:de:bsz:mh39-68610
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Konferenzbeitrag

Associated

  • Brinckmann, Caren
  • Trouvain, Jürgen
  • Baixas : ISCA

Time of origin

  • 2017-12-20

Other Objects (12)