Buchbeitrag

Creating an extensible, levelled study corpus of Russian

zu Verbundenen Objekten

In this paper, we present first results of training a classifier for discriminating Russian texts into different levels of difficulty. For the classification we considered both surface-oriented features adopted from readability assessments and more linguistically informed, positional features to classify texts into two levels of difficulty. This text classification is the main focus of our Levelled Study Corpus of Russian (LeStCoR), in which we aim to build a corpus adapted for language learning purposes – selecting simpler texts for beginner second language learners and more complex texts for advanced learners. The most discriminative feature in our pilot study was a lexical feature that approximates accessibility of the vocabulary by the second language learner in terms of the proportion of familiar words in the texts. The best feature setting achieved an accuracy of 0.91 on a pilot corpus of 209 texts.

Creating an extensible, levelled study corpus of Russian

Urheber*in: Batinić, Dolores; Birzer, Sandra; Zinsmeister, Heike

Urheberrechtsschutz

Sprache: Englisch

Thema: Russisch
Korpus <Linguistik>
Sprache

Ereignis: Geistige Schöpfung

(wer): Batinić, Dolores
Birzer, Sandra
Zinsmeister, Heike

Ereignis: Veröffentlichung

(wer): Bochum : Ruhr-Universität Bochum

(wann): 2017-02-27

URN: urn:nbn:de:bsz:mh39-59235

Letzte Aktualisierung: 06.03.2025, 09:00 MEZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
Leibniz-Institut für Deutsche Sprache - Bibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Original beim Datenpartner anzeigen

Objekttyp

Buchbeitrag

Beteiligte

Batinić, Dolores
Birzer, Sandra
Zinsmeister, Heike
Bochum : Ruhr-Universität Bochum

Entstanden

2017-02-27

Ähnliche Objekte (12)

Creating an extensible, levelled study corpus of Russian

Creating an extensible, levelled study corpus of Russian

Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German

Buchbeitrag

Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German

Creating employment incentives

Buchbeitrag

Creating employment incentives

Corpus REDEWIEDERGABE

Buchbeitrag

Corpus REDEWIEDERGABE

Das Mannheimer Corpus

Buchbeitrag

Das Mannheimer Corpus

Automatic classification of Russian texts for didactic purposes

Buchbeitrag

Automatic classification of Russian texts for didactic purposes

Categories and Paradigms. On Underspecification in Russian Declension

Buchbeitrag

Categories and Paradigms. On Underspecification in Russian Declension

Creating a classroom culture which promotes positive attitudes and motivated learners

Buchbeitrag

Creating a classroom culture which promotes positive attitudes and motivated learners

The implications of English-Russian interactions in mass media

Buchbeitrag

The implications of English-Russian interactions in mass media

Foreign proper names in Russian radio and TV broadcasts

Buchbeitrag

Foreign proper names in Russian radio and TV broadcasts

The language of Russian political discourse and national myth

Buchbeitrag

The language of Russian political discourse and national myth

Creating the lexicon of multi-word expressions for Slovene methodology and structure

Buchbeitrag

Creating the lexicon of multi-word expressions for Slovene methodology and structure

Creating an extensible, levelled study corpus of Russian

Creating an extensible, levelled study corpus of Russian

Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German

Buchbeitrag

Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German

Creating employment incentives

Buchbeitrag

Creating employment incentives

Corpus REDEWIEDERGABE

Buchbeitrag

Corpus REDEWIEDERGABE

Das Mannheimer Corpus

Buchbeitrag

Das Mannheimer Corpus

Automatic classification of Russian texts for didactic purposes

Buchbeitrag

Automatic classification of Russian texts for didactic purposes

Categories and Paradigms. On Underspecification in Russian Declension

Buchbeitrag

Categories and Paradigms. On Underspecification in Russian Declension

Creating a classroom culture which promotes positive attitudes and motivated learners

Buchbeitrag

Creating a classroom culture which promotes positive attitudes and motivated learners

The implications of English-Russian interactions in mass media

Buchbeitrag

The implications of English-Russian interactions in mass media

Foreign proper names in Russian radio and TV broadcasts

Buchbeitrag

Foreign proper names in Russian radio and TV broadcasts

The language of Russian political discourse and national myth

Buchbeitrag

The language of Russian political discourse and national myth

Creating the lexicon of multi-word expressions for Slovene methodology and structure

Buchbeitrag

Creating the lexicon of multi-word expressions for Slovene methodology and structure

Creating an extensible, levelled study corpus of Russian

Creating an extensible, levelled study corpus of Russian

Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German

Buchbeitrag

Technological and methodological challenges in creating, annotating and sharing a learner corpus of spoken German

Creating employment incentives

Buchbeitrag

Creating employment incentives

Corpus REDEWIEDERGABE

Buchbeitrag

Corpus REDEWIEDERGABE

Das Mannheimer Corpus

Buchbeitrag

Das Mannheimer Corpus

Automatic classification of Russian texts for didactic purposes

Buchbeitrag

Automatic classification of Russian texts for didactic purposes

Categories and Paradigms. On Underspecification in Russian Declension

Buchbeitrag

Categories and Paradigms. On Underspecification in Russian Declension

Creating a classroom culture which promotes positive attitudes and motivated learners

Buchbeitrag

Creating a classroom culture which promotes positive attitudes and motivated learners

The implications of English-Russian interactions in mass media

Buchbeitrag

The implications of English-Russian interactions in mass media

Foreign proper names in Russian radio and TV broadcasts

Buchbeitrag

Foreign proper names in Russian radio and TV broadcasts

The language of Russian political discourse and national myth

Buchbeitrag

The language of Russian political discourse and national myth

Creating the lexicon of multi-word expressions for Slovene methodology and structure

Buchbeitrag

Creating the lexicon of multi-word expressions for Slovene methodology and structure

Informationen zur Registrierung von Kultur- und Wissenseinrichtungen finden Sie hier.

Felder mit * müssen ausgefüllt werden.

Benutzername*

Bitte geben Sie Ihren Benutzernamen ein

E-Mail*

Bitte geben Sie Ihre E-Mail ein

Bitte füllen Sie dieses Feld nicht aus

Vorname

Nachname

Passwort*

Bitte geben Sie Ihr Passwort ein

Passwort bestätigen*

Bitte geben Sie das gleiche Passwort ein

Ich habe die Nutzungsbedingungen und die Datenschutzerklärung zur Erhebung persönlicher Daten gelesen und stimme ihnen zu. *

Dieses Feld ist ein Pflichtfeld.

Ich möchte den Newsletter der Deutschen Digitalen Bibliothek abonnieren. Siehe Informationen zum Newsletter-Abonnement.

Benutzerkonto angelegt

Ihr „Meine DDB“-Konto wurde erfolgreich angelegt. Bevor Sie sich in Ihrem Konto anmelden können, müssen Sie auf den Bestätigungslink in der Nachricht klicken, die wir gerade an die von Ihnen angegebene E-Mail-Adresse geschickt haben