Konferenzbeitrag

Detecting the boundaries of sentence-like units on spoken German

Automatic division of spoken language transcripts into sentence-like units is a challenging problem, caused by disfluencies, ungrammatical structures and the lack of punctuation. We present experiments on dividing up German spoken dialogues where we investigate the impact of task setup and data representation, encoding of context information as well as different model architectures for this task.

Detecting the boundaries of sentence-like units on spoken German

Urheber*in: Ruppenhofer, Josef; Rehbein, Ines

Attribution - NonCommercial - ShareAlike 4.0 International

Language
Englisch

Subject
Deutsch
Gesprochene Sprache
Automatische Sprachanalyse
Segmentierung
Satz
Sprache

Event
Geistige Schöpfung
(who)
Ruppenhofer, Josef
Rehbein, Ines
Event
Veröffentlichung
(who)
München [u.a.] : German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg
(when)
2019-10-15

URN
urn:nbn:de:bsz:mh39-93174
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Konferenzbeitrag

Associated

  • Ruppenhofer, Josef
  • Rehbein, Ines
  • München [u.a.] : German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg

Time of origin

  • 2019-10-15

Other Objects (12)