Konferenzbeitrag

Detecting the boundaries of sentence-like units on spoken German

Automatic division of spoken language transcripts into sentence-like units is a challenging problem, caused by disfluencies, ungrammatical structures and the lack of punctuation. We present experiments on dividing up German spoken dialogues where we investigate the impact of task setup and data representation, encoding of context information as well as different model architectures for this task.

Urheber*in: Ruppenhofer, Josef; Rehbein, Ines

Attribution - NonCommercial - ShareAlike 4.0 International

Language: Englisch

Subject: Deutsch
Gesprochene Sprache
Automatische Sprachanalyse
Segmentierung
Satz
Sprache

Event: Geistige Schöpfung

(who): Ruppenhofer, Josef
Rehbein, Ines

Event: Veröffentlichung

(who): München [u.a.] : German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg

(when): 2019-10-15

URN: urn:nbn:de:bsz:mh39-93174

Last update: 06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Show original at data provider

Object type

Konferenzbeitrag

Associated

Ruppenhofer, Josef
Rehbein, Ines
München [u.a.] : German Society for Computational Linguistics & Language Technology und Friedrich-Alexander-Universität Erlangen-Nürnberg

Time of origin

2019-10-15

Other Objects (12)

Konferenzbeitrag

Improving Sentence Boundary Detection for Spoken Language Transcripts

Konferenzbeitrag

A New Resource for German Causal Language

Konferenzbeitrag

There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task

Buchbeitrag

Detecting annotation noise in automatically labelled data

Buchbeitrag

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Buchbeitrag

Sprucing up the trees – error detection in treebanks

Konferenzbeitrag

Evaluating the Impact of Coder Errors on Active Learning

Konferenzbeitrag

Semantic frames as an anchor representation for sentiment analysis

Konferenzbeitrag

Catching the common cause: extraction and annotation of causal relations and their participants

Konferenzbeitrag

Yes we can!? Annotating the senses of English modal verbs

Konferenzbeitrag

Who is we? Disambiguating the referents of first person plural pronouns in parliamentary debates

Artikel

Is it worth the effort? Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation

Konferenzbeitrag

Improving Sentence Boundary Detection for Spoken Language Transcripts

Konferenzbeitrag

A New Resource for German Causal Language

Konferenzbeitrag

There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task

Buchbeitrag

Detecting annotation noise in automatically labelled data

Buchbeitrag

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Buchbeitrag

Sprucing up the trees – error detection in treebanks

Konferenzbeitrag

Evaluating the Impact of Coder Errors on Active Learning

Konferenzbeitrag

Semantic frames as an anchor representation for sentiment analysis

Konferenzbeitrag

Catching the common cause: extraction and annotation of causal relations and their participants

Konferenzbeitrag

Yes we can!? Annotating the senses of English modal verbs

Konferenzbeitrag

Who is we? Disambiguating the referents of first person plural pronouns in parliamentary debates

Artikel

Is it worth the effort? Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation

Konferenzbeitrag

Improving Sentence Boundary Detection for Spoken Language Transcripts

Konferenzbeitrag

A New Resource for German Causal Language

Konferenzbeitrag

There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task

Buchbeitrag

Detecting annotation noise in automatically labelled data

Buchbeitrag

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Buchbeitrag

Sprucing up the trees – error detection in treebanks

Konferenzbeitrag

Evaluating the Impact of Coder Errors on Active Learning

Konferenzbeitrag

Semantic frames as an anchor representation for sentiment analysis

Konferenzbeitrag

Catching the common cause: extraction and annotation of causal relations and their participants

Konferenzbeitrag

Yes we can!? Annotating the senses of English modal verbs

Konferenzbeitrag

Who is we? Disambiguating the referents of first person plural pronouns in parliamentary debates

Artikel

Is it worth the effort? Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation

Cultural heritage institutions wishing to register will find more information here.

Fields marked * need to be filled in.

Username*

Please enter your username

Email*

Please enter your email address

Please do not fill this field

First name

Last name

Password*

Please enter your password

Confirm password*

Please enter the same password

I have read the terms of use and the privacy policy for the collection of personal data and accept them. *

This field is required.

I would like to subscribe to the newsletter of the Deutsche Digitale Bibliothek. See newsletter subscription info.

Account created

Your "My DDB" account has been successfully created. Before you can log in to your account, you must click the confirmation link in the message we just sent to the email address you provided.

Detecting the boundaries of sentence-like units on spoken German

Download

Object Details

Classification and Topics

Contributors, Places and Time

Further information

Data provider

Object type

Associated

Time of origin

Other Objects (12)

Improving Sentence Boundary Detection for Spoken Language Transcripts

A New Resource for German Causal Language

There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task

Detecting annotation noise in automatically labelled data

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Sprucing up the trees – error detection in treebanks

Evaluating the Impact of Coder Errors on Active Learning

Semantic frames as an anchor representation for sentiment analysis

Catching the common cause: extraction and annotation of causal relations and their participants

Yes we can!? Annotating the senses of English modal verbs

Who is we? Disambiguating the referents of first person plural pronouns in parliamentary debates

Is it worth the effort? Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation

Improving Sentence Boundary Detection for Spoken Language Transcripts

A New Resource for German Causal Language

There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task

Detecting annotation noise in automatically labelled data

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Sprucing up the trees – error detection in treebanks

Evaluating the Impact of Coder Errors on Active Learning

Semantic frames as an anchor representation for sentiment analysis

Catching the common cause: extraction and annotation of causal relations and their participants

Yes we can!? Annotating the senses of English modal verbs

Who is we? Disambiguating the referents of first person plural pronouns in parliamentary debates

Is it worth the effort? Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation

Improving Sentence Boundary Detection for Spoken Language Transcripts

A New Resource for German Causal Language

There’s no Data like More Data? Revisiting the Impact of Data Size on a Classification Task

Detecting annotation noise in automatically labelled data

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Sprucing up the trees – error detection in treebanks

Evaluating the Impact of Coder Errors on Active Learning

Semantic frames as an anchor representation for sentiment analysis

Catching the common cause: extraction and annotation of causal relations and their participants

Yes we can!? Annotating the senses of English modal verbs

Who is we? Disambiguating the referents of first person plural pronouns in parliamentary debates

Is it worth the effort? Assessing the benefits of partial automatic pre-labeling for frame-semantic annotation

Related objects

Reset password