Konferenzbeitrag
Building and Annotating a Corpus of German-Language Newsgroups
Usenet is a large online resource containing user-generated messages (news articles) organised in discussion groups (newsgroups) which deal with a wide variety of different topics. We describe the download, conversion, and annotation of a comprehensive German news corpus for integration in DeReKo, the German Reference Corpus hosted at the Institut für Deutsche Sprache in Mannheim.
- Language
-
Englisch
- Subject
-
Korpus <Linguistik>
Annotation
Linguistik
- Event
-
Geistige Schöpfung
- (who)
-
Schröck, Jasmin
Lüngen, Harald
- Event
-
Veröffentlichung
- (who)
-
German Society for Computational Linguistics & Language Technology (GSCL)
- (when)
-
2015-11-12
- URN
-
urn:nbn:de:bsz:mh39-43640
- Last update
-
06.03.2025, 9:00 AM CET
Data provider
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.
Object type
- Konferenzbeitrag
Associated
- Schröck, Jasmin
- Lüngen, Harald
- German Society for Computational Linguistics & Language Technology (GSCL)
Time of origin
- 2015-11-12