Konferenzbeitrag
Detecting the boundaries of sentence-like units on spoken German
Automatic division of spoken language transcripts into sentence-like units is a challenging problem, caused by disfluencies, ungrammatical structures and the lack of punctuation. We present experiments on dividing up German spoken dialogues where we investigate the impact of task setup and data representation, encoding of context information as well as different model architectures for this task.
- Language
-
Englisch
- Subject
-
Deutsch
Gesprochene Sprache
Automatische Sprachanalyse
Segmentierung
Satz
Sprache
- Event
-
Geistige Schöpfung
- (who)
-
Ruppenhofer, Josef
Rehbein, Ines
- Event
-
Veröffentlichung
- (who)
-
München [u.a.] : German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg
- (when)
-
2019-10-15
- URN
-
urn:nbn:de:bsz:mh39-93174
- Last update
-
06.03.2025, 9:00 AM CET
Data provider
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.
Object type
- Konferenzbeitrag
Associated
- Ruppenhofer, Josef
- Rehbein, Ines
- München [u.a.] : German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg
Time of origin
- 2019-10-15