Konferenzbeitrag

CMC Corpora in DeReKo

We introduce three types of corpora of computer-mediated communication that have recently been compiled at the Institute for the German Language or curated from an external project and included in DeReKo, the German Reference Corpus, namely Wikipedia (discussion) corpora, the Usenet news corpus, and the Dortmund Chat Corpus. The data and corpora have been converted to I5, the TEI customization to represent texts in DeReKo, and are researchable via the web-based IDS corpus research interfaces and in the case of Wikipedia and chat also downloadable from the IDS repository and download server, respectively.

CMC Corpora in DeReKo

Urheber*in: Lüngen, Harald; Kupietz, Marc

Attribution - NonCommercial - NoDerivates 4.0 International

0
/
0

Language
Englisch

Subject
Korpus <Linguistik>
Deutsch
Internet
Wikipedia
UseNet
Sprache

Event
Geistige Schöpfung
(who)
Lüngen, Harald
Kupietz, Marc
Event
Veröffentlichung
(who)
Mannheim : Institut für Deutsche Sprache
(when)
2017-07-05

URN
urn:nbn:de:bsz:mh39-62592
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Konferenzbeitrag

Associated

  • Lüngen, Harald
  • Kupietz, Marc
  • Mannheim : Institut für Deutsche Sprache

Time of origin

  • 2017-07-05

Other Objects (12)