Konferenzbeitrag

A cross-database comparison of two large German speech databases

Ph@ttSessionz and Deutsch heute are two large German speech databases. They were created for different purposes: Ph@ttSessionz to test Internet-based recordings and to adapt speech recognizers to the voices of adolescent speakers, Deutsch heute to document regional variation of German. The databases differ in their recording technique, the selection of recording locations and speakers, elicitation mode, and data processing. In this paper, we outline how the recordings were performed, how the data was processed and annotated, and how the two databases were imported into a single relational database system. We present acoustical measurements on the digit items of both databases. Our results confirm that the elicitation technique affects the speech produced, that f0 is quite comparable despite different recording procedures, and that large speech technology databases with suitable metadata may well be used for the analysis of regional variation of speech.

A cross-database comparison of two large German speech databases

Urheber*in: Draxler, Christoph; Kleiner, Stefan

Namensnennung - Nicht kommerziell - Keine Bearbeitungen 4.0 International

Sprache
Englisch

Thema
Deutsch
Akustische Phonetik
Gesprochene Sprache
Korpus <Linguistik>
Sprachvariante
Annotation
Metadaten
Germanische Sprachen; Deutsch

Ereignis
Geistige Schöpfung
(wer)
Draxler, Christoph
Kleiner, Stefan
Ereignis
Veröffentlichung
(wer)
London : International Phonetic Association (IPA)
(wann)
2017-03-21

URN
urn:nbn:de:bsz:mh39-59983
Letzte Aktualisierung
06.03.2025, 09:00 MEZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
Leibniz-Institut für Deutsche Sprache - Bibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Objekttyp

  • Konferenzbeitrag

Beteiligte

  • Draxler, Christoph
  • Kleiner, Stefan
  • London : International Phonetic Association (IPA)

Entstanden

  • 2017-03-21

Ähnliche Objekte (12)