Buchbeitrag

KoGra-DB: Using MapReduce for language corpora

Linguistic query systems are special purpose IR applications. We present a novel state-of-the-art approach for the efficient exploitation of very large linguistic corpora, combining the advantages of relational database management systems (RDBMS) with the functional MapReduce programming model. Our implementation uses the German DEREKO reference corpus with multi-layer linguistic annotations and several types of text-specific metadata, but the proposed strategy is language-independent and adaptable to large-scale multilingual corpora.

KoGra-DB: Using MapReduce for language corpora

Urheber*in: Schneider, Roman

In copyright

0
/
0

Language
Englisch

Subject
Korpus <Linguistik>
Automatische Sprachanalyse
Sprache

Event
Geistige Schöpfung
(who)
Schneider, Roman
Event
Veröffentlichung
(who)
Bonn-Buschdorf : Köllen
(when)
2018-02-02

URN
urn:nbn:de:bsz:mh39-70363
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Buchbeitrag

Associated

  • Schneider, Roman
  • Bonn-Buschdorf : Köllen

Time of origin

  • 2018-02-02

Other Objects (12)