Konferenzbeitrag

Cost-Sensitive Learning in Answer Extraction

One problem of data-driven answer extraction in open-domain factoid question answering is that the class distribution of labeled training data is fairly imbalanced. In an ordinary training set, there are far more incorrect answers than correct answers. The class-imbalance is, thus, inherent to the classification task. It has a deteriorating effect on the performance of classifiers trained by standard machine learning algorithms. They usually have a heavy bias towards the majority class, i.e. the class which occurs most often in the training set. In this paper, we propose a method to tackle class imbalance by applying some form of cost-sensitive learning which is preferable to sampling. We present a simple but effective way of estimating the misclassification costs on the basis of class distribution. This approach offers three benefits. Firstly, it maintains the distribution of the classes of the labeled training data. Secondly, this form of meta-learning can be applied to a wide range of common learning algorithms. Thirdly, this approach can be easily implemented with the help of state-of-the-art machine learning software.

Urheber*in: Wiegand, Michael; Leidner, Jochen L.; Klakow, Dietrich

Attribution 4.0 International

Language: Englisch

Subject: Computerlinguistik
Information Extraction
Maschinelles Lernen
Natürliche Sprache
Sprache

Event: Geistige Schöpfung

(who): Wiegand, Michael
Leidner, Jochen L.
Klakow, Dietrich

Event: Veröffentlichung

(who): Paris : European Language Resources Association

(when): 2019-02-28

URN: urn:nbn:de:bsz:mh39-85373

Last update: 06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Show original at data provider

Object type

Konferenzbeitrag

Associated

Wiegand, Michael
Leidner, Jochen L.
Klakow, Dietrich
Paris : European Language Resources Association

Time of origin

2019-02-28

Other Objects (12)

Cost-Sensitive Learning in Answer Extraction

Abschlussarbeit (Master)

Event-Based Modelling in Question Answering

Dissertation o. Habilitation

Hybrid Approaches for Sentiment Analysis

Hochschulschrift

Der Einfluss des Bobfahrens auf die Herzfrequenz und den Stoffwechsel

Hochschulschrift

Wirkungen von Schlafentzug und Tagschlaf auf die Befindlichkeit depressiver Patienten

Hochschulschrift

Zur Korrelationspathologie des spätfetalen Thymus : Vergleichende Untersuchungen an Thymus, Nebenniere und Placenta bei perinatalen Todesfällen.

Konferenzbeitrag

Predicate Acquisition for Opinion Holder Extraction. A Data-Intensive Approach

Predicate Acquisition for Opinion Holder Extraction : A Data-Intensive Approach

Konferenzbeitrag

Convolution Kernels for Opinion Holder Extraction

Konferenzbeitrag

The Role of Predicates in Opinion Holder Extraction

Cost-Sensitive Learning in Answer Extraction

Abschlussarbeit (Master)

Event-Based Modelling in Question Answering

Dissertation o. Habilitation

Hybrid Approaches for Sentiment Analysis

Hochschulschrift

Der Einfluss des Bobfahrens auf die Herzfrequenz und den Stoffwechsel

Hochschulschrift

Wirkungen von Schlafentzug und Tagschlaf auf die Befindlichkeit depressiver Patienten

Hochschulschrift

Zur Korrelationspathologie des spätfetalen Thymus : Vergleichende Untersuchungen an Thymus, Nebenniere und Placenta bei perinatalen Todesfällen.

Konferenzbeitrag

Predicate Acquisition for Opinion Holder Extraction. A Data-Intensive Approach

Predicate Acquisition for Opinion Holder Extraction : A Data-Intensive Approach

Konferenzbeitrag

Convolution Kernels for Opinion Holder Extraction

Konferenzbeitrag

The Role of Predicates in Opinion Holder Extraction

Cost-Sensitive Learning in Answer Extraction

Abschlussarbeit (Master)

Event-Based Modelling in Question Answering

Dissertation o. Habilitation

Hybrid Approaches for Sentiment Analysis

Hochschulschrift

Der Einfluss des Bobfahrens auf die Herzfrequenz und den Stoffwechsel

Hochschulschrift

Wirkungen von Schlafentzug und Tagschlaf auf die Befindlichkeit depressiver Patienten

Hochschulschrift

Zur Korrelationspathologie des spätfetalen Thymus : Vergleichende Untersuchungen an Thymus, Nebenniere und Placenta bei perinatalen Todesfällen.

Konferenzbeitrag

Predicate Acquisition for Opinion Holder Extraction. A Data-Intensive Approach

Predicate Acquisition for Opinion Holder Extraction : A Data-Intensive Approach

Konferenzbeitrag

Convolution Kernels for Opinion Holder Extraction

Konferenzbeitrag

The Role of Predicates in Opinion Holder Extraction

Cultural heritage institutions wishing to register will find more information here.

Fields marked * need to be filled in.

Username*

Please enter your username

Email*

Please enter your email address

Please do not fill this field

First name

Last name

Password*

Please enter your password

Confirm password*

Please enter the same password

I have read the terms of use and the privacy policy for the collection of personal data and accept them. *

This field is required.

I would like to subscribe to the newsletter of the Deutsche Digitale Bibliothek. See newsletter subscription info.

Account created

Your "My DDB" account has been successfully created. Before you can log in to your account, you must click the confirmation link in the message we just sent to the email address you provided.

Cost-Sensitive Learning in Answer Extraction

Download

Object Details

Classification and Topics

Contributors, Places and Time

Further information

Data provider

Object type

Associated

Time of origin

Other Objects (12)

Cost-Sensitive Learning in Answer Extraction

Event-Based Modelling in Question Answering

Hybrid Approaches for Sentiment Analysis

Der Einfluss des Bobfahrens auf die Herzfrequenz und den Stoffwechsel

Wirkungen von Schlafentzug und Tagschlaf auf die Befindlichkeit depressiver Patienten

Zur Korrelationspathologie des spätfetalen Thymus : Vergleichende Untersuchungen an Thymus, Nebenniere und Placenta bei perinatalen Todesfällen.

Predicate Acquisition for Opinion Holder Extraction. A Data-Intensive Approach

Predicate Acquisition for Opinion Holder Extraction : A Data-Intensive Approach

Convolution Kernels for Opinion Holder Extraction

Convolution Kernels for Opinion Holder Extraction

The Role of Predicates in Opinion Holder Extraction

The Role of Predicates in Opinion Holder Extraction

Cost-Sensitive Learning in Answer Extraction

Event-Based Modelling in Question Answering

Hybrid Approaches for Sentiment Analysis

Der Einfluss des Bobfahrens auf die Herzfrequenz und den Stoffwechsel

Wirkungen von Schlafentzug und Tagschlaf auf die Befindlichkeit depressiver Patienten

Zur Korrelationspathologie des spätfetalen Thymus : Vergleichende Untersuchungen an Thymus, Nebenniere und Placenta bei perinatalen Todesfällen.

Predicate Acquisition for Opinion Holder Extraction. A Data-Intensive Approach

Predicate Acquisition for Opinion Holder Extraction : A Data-Intensive Approach

Convolution Kernels for Opinion Holder Extraction

Convolution Kernels for Opinion Holder Extraction

The Role of Predicates in Opinion Holder Extraction

The Role of Predicates in Opinion Holder Extraction

Cost-Sensitive Learning in Answer Extraction

Event-Based Modelling in Question Answering

Hybrid Approaches for Sentiment Analysis

Der Einfluss des Bobfahrens auf die Herzfrequenz und den Stoffwechsel

Wirkungen von Schlafentzug und Tagschlaf auf die Befindlichkeit depressiver Patienten

Zur Korrelationspathologie des spätfetalen Thymus : Vergleichende Untersuchungen an Thymus, Nebenniere und Placenta bei perinatalen Todesfällen.

Predicate Acquisition for Opinion Holder Extraction. A Data-Intensive Approach

Predicate Acquisition for Opinion Holder Extraction : A Data-Intensive Approach

Convolution Kernels for Opinion Holder Extraction

Convolution Kernels for Opinion Holder Extraction

The Role of Predicates in Opinion Holder Extraction

The Role of Predicates in Opinion Holder Extraction

Related objects

Reset password