Konferenzbeitrag

Web corpora - the best possible solution for tracking rare phenomena in underresourced languages: clitics in Bosnian, Croatian and Serbian

Complex linguistic phenomena, such as Clitic Climbing in Bosnian, Croatian and Serbian, are often described intuitively, only from the perspective of the main tendency. In this paper, we argue that web corpora currently offer the best source of empirical material for studying Clitic Climbing in BCS. They thus allow the most accurate description of this phenomenon, as less frequent constructions can be tracked only in big, well-annotated data sources. We compare the properties of web corpora for BCS with traditional sources and give examples of studies on CC based on web corpora. Furthermore, we discuss problems related to web corpora and suggest some improvements for the future.

Web corpora - the best possible solution for tracking rare phenomena in underresourced languages: clitics in Bosnian, Croatian and Serbian

Urheber*in: Jurkiewicz-Rohrbacher, Edyta; Kolaković, Zrinka; Hansen, Björn

Attribution - NonCommercial - NoDerivates 4.0 International

Language
Englisch

Subject
Korpus <Linguistik>
Internet
Bosnisch
Serbisch
Kroatisch
Morphem
Sprache

Event
Geistige Schöpfung
(who)
Jurkiewicz-Rohrbacher, Edyta
Kolaković, Zrinka
Hansen, Björn
Event
Veröffentlichung
(who)
Mannheim : Institut für Deutsche Sprache
(when)
2017-07-05

URN
urn:nbn:de:bsz:mh39-62667
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Konferenzbeitrag

Associated

  • Jurkiewicz-Rohrbacher, Edyta
  • Kolaković, Zrinka
  • Hansen, Björn
  • Mannheim : Institut für Deutsche Sprache

Time of origin

  • 2017-07-05

Other Objects (12)