Konferenzbeitrag

Recent developments in DeReKo

This paper gives an overview of recent developments in the German Reference Corpus DeReKo in terms of growth, maximising relevant corpus strata, metadata, legal issues, and its current and future research interface. Due to the recent acquisition of new licenses, DeReKo has grown by a factor of four in the first half of 2014, mostly in the area of newspaper text, and presently contains over 24 billion word tokens. Other strata, like fictional texts, web corpora, in particular CMC texts, and spoken but conceptually written texts have also increased significantly. We report on the newly acquired corpora that led to the major increase, on the principles and strategies behind our corpus acquisition activities, and on our solutions for the emerging legal, organisational, and technical challenges.

Recent developments in DeReKo

Urheber*in: Kupietz, Marc; Lüngen, Harald

Urheberrechtsschutz

0
/
0

Sprache
Englisch

Thema
Deutsch
Korpus <Linguistik>
Textkorpus

Ereignis
Geistige Schöpfung
(wer)
Kupietz, Marc
Lüngen, Harald
Ereignis
Veröffentlichung
(wer)
Reykjavik : European Language Resources Association (ELRA)
(wann)
2014-10-13

URN
urn:nbn:de:bsz:mh39-31353
Letzte Aktualisierung
06.03.2025, 09:00 MEZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
Leibniz-Institut für Deutsche Sprache - Bibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Objekttyp

  • Konferenzbeitrag

Beteiligte

  • Kupietz, Marc
  • Lüngen, Harald
  • Reykjavik : European Language Resources Association (ELRA)

Entstanden

  • 2014-10-13

Ähnliche Objekte (12)