Konferenzbeitrag

Building and Annotating a Corpus of German-Language Newsgroups

Usenet is a large online resource containing user-generated messages (news articles) organised in discussion groups (newsgroups) which deal with a wide variety of different topics. We describe the download, conversion, and annotation of a comprehensive German news corpus for integration in DeReKo, the German Reference Corpus hosted at the Institut für Deutsche Sprache in Mannheim.

Building and Annotating a Corpus of German-Language Newsgroups

Urheber*in: Schröck, Jasmin; Lüngen, Harald

In copyright

0
/
0

Language
Englisch

Subject
Korpus <Linguistik>
Annotation
Linguistik

Event
Geistige Schöpfung
(who)
Schröck, Jasmin
Lüngen, Harald
Event
Veröffentlichung
(who)
German Society for Computational Linguistics & Language Technology (GSCL)
(when)
2015-11-12

URN
urn:nbn:de:bsz:mh39-43640
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Konferenzbeitrag

Associated

  • Schröck, Jasmin
  • Lüngen, Harald
  • German Society for Computational Linguistics & Language Technology (GSCL)

Time of origin

  • 2015-11-12

Other Objects (12)