Buchbeitrag

Automatic classification of Russian texts for didactic purposes

In this paper we present the results of an automatic classification of Russian texts into three levels of difficulty. Our aim is to build a study corpus of Russian, in which a L2 student is able to select texts of a desired complexity. We are building on a pilot study, in which we classified Russian texts into two levels of difficulty. In the current paper, we apply the classification to an extended corpus of 577 labelled texts. The best-performing combination of features achieves an accuracy of 0,74 within at most one level difference.

Automatic classification of Russian texts for didactic purposes

Urheber*in: Batinić, Dolores; Birzer, Sandra; Zinsmeister, Heike

In copyright

Language
Englisch

Subject
Korpus <Linguistik>
Fremdsprachenlernen
Russisch
Automatische Sprachanalyse
Sprache

Event
Geistige Schöpfung
(who)
Batinić, Dolores
Birzer, Sandra
Zinsmeister, Heike
Event
Veröffentlichung
(who)
Sankt-Peterburg : Izdatel´stvo Sankt-Peterburgskogo gosudarstvennogo universiteta
(when)
2017-10-25

URN
urn:nbn:de:bsz:mh39-66003
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Buchbeitrag

Associated

  • Batinić, Dolores
  • Birzer, Sandra
  • Zinsmeister, Heike
  • Sankt-Peterburg : Izdatel´stvo Sankt-Peterburgskogo gosudarstvennogo universiteta

Time of origin

  • 2017-10-25

Other Objects (12)