Buchbeitrag

Authorship attribution with convolutional neural networks and POS-eliding

We use a convolutional neural network to perform authorship identification on a very homogeneous dataset of scientific publications. In order to investigate the effect of domain biases, we obscure words below a certain frequency threshold, retaining only their POS-tags. This procedure improves test performance due to better generalization on unseen data. Using our method, we are able to predict the authors of scientific publications in the same discipline at levels well above chance.

Urheber*in: Hitschler, Julian; van den Berg, Esther; Rehbein, Ines

Attribution 4.0 International

Language: Englisch

Subject: Autorschaft
Computerlinguistik
Sprache

Event: Geistige Schöpfung

(who): Hitschler, Julian
van den Berg, Esther
Rehbein, Ines

Event: Veröffentlichung

(who): Stroudsburg PA, USA : The Association for Computational Linguistics

(when): 2018-10-02

URN: urn:nbn:de:bsz:mh39-80252

Last update: 06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Show original at data provider

Object type

Buchbeitrag

Associated

Hitschler, Julian
van den Berg, Esther
Rehbein, Ines
Stroudsburg PA, USA : The Association for Computational Linguistics

Time of origin

2018-10-02

Other Objects (12)

Authorship attribution with convolutional neural networks and POS-eliding

Buchbeitrag

Metaphor detection for German poetry

Buchbeitrag

Detecting annotation noise in automatically labelled data

Buchbeitrag

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Buchbeitrag

Sprucing up the trees – error detection in treebanks

Buchbeitrag

Who’s in, who’s out? Predicting the inclusiveness or exclusiveness of personal pronouns in parliamentary debates

Dissertation o. Habilitation

Treebank-Based Grammar Acquisition for German

Konferenzbeitrag

Data point selection for self-training

Konferenzbeitrag

POS error detection in automatically annotated corpora

Artikel

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Data point selection for self-training

Authorship attribution with convolutional neural networks and POS-eliding

Buchbeitrag

Metaphor detection for German poetry

Buchbeitrag

Detecting annotation noise in automatically labelled data

Buchbeitrag

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Buchbeitrag

Sprucing up the trees – error detection in treebanks

Buchbeitrag

Who’s in, who’s out? Predicting the inclusiveness or exclusiveness of personal pronouns in parliamentary debates

Dissertation o. Habilitation

Treebank-Based Grammar Acquisition for German

Konferenzbeitrag

Data point selection for self-training

Konferenzbeitrag

POS error detection in automatically annotated corpora

Artikel

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Data point selection for self-training

Authorship attribution with convolutional neural networks and POS-eliding

Buchbeitrag

Metaphor detection for German poetry

Buchbeitrag

Detecting annotation noise in automatically labelled data

Buchbeitrag

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Buchbeitrag

Sprucing up the trees – error detection in treebanks

Buchbeitrag

Who’s in, who’s out? Predicting the inclusiveness or exclusiveness of personal pronouns in parliamentary debates

Dissertation o. Habilitation

Treebank-Based Grammar Acquisition for German

Konferenzbeitrag

Data point selection for self-training

Konferenzbeitrag

POS error detection in automatically annotated corpora

Artikel

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Data point selection for self-training

Cultural heritage institutions wishing to register will find more information here.

Fields marked * need to be filled in.

Username*

Please enter your username

Email*

Please enter your email address

Please do not fill this field

First name

Last name

Password*

Please enter your password

Confirm password*

Please enter the same password

I have read the terms of use and the privacy policy for the collection of personal data and accept them. *

This field is required.

I would like to subscribe to the newsletter of the Deutsche Digitale Bibliothek. See newsletter subscription info.

Account created

Your "My DDB" account has been successfully created. Before you can log in to your account, you must click the confirmation link in the message we just sent to the email address you provided.

Authorship attribution with convolutional neural networks and POS-eliding

Download

Object Details

Classification and Topics

Contributors, Places and Time

Further information

Data provider

Object type

Associated

Time of origin

Other Objects (12)

Authorship attribution with convolutional neural networks and POS-eliding

Metaphor detection for German poetry

Detecting annotation noise in automatically labelled data

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Sprucing up the trees – error detection in treebanks

Who’s in, who’s out? Predicting the inclusiveness or exclusiveness of personal pronouns in parliamentary debates

Treebank-Based Grammar Acquisition for German

Data point selection for self-training

POS error detection in automatically annotated corpora

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Data point selection for self-training

Authorship attribution with convolutional neural networks and POS-eliding

Metaphor detection for German poetry

Detecting annotation noise in automatically labelled data

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Sprucing up the trees – error detection in treebanks

Who’s in, who’s out? Predicting the inclusiveness or exclusiveness of personal pronouns in parliamentary debates

Treebank-Based Grammar Acquisition for German

Data point selection for self-training

POS error detection in automatically annotated corpora

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Data point selection for self-training

Authorship attribution with convolutional neural networks and POS-eliding

Metaphor detection for German poetry

Detecting annotation noise in automatically labelled data

I’ve got a construction looks funny – representing and recovering non-standard constructions in UD

Sprucing up the trees – error detection in treebanks

Who’s in, who’s out? Predicting the inclusiveness or exclusiveness of personal pronouns in parliamentary debates

Treebank-Based Grammar Acquisition for German

Data point selection for self-training

POS error detection in automatically annotated corpora

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Der Einfluss der Dependenzgrammatik auf die Computerlinguistik

Data point selection for self-training

Related objects

Reset password