Konferenzbeitrag
MLSA – A Multi-layered Reference Corpus for German Sentiment Analysis
In this paper, we describe MLSA, a publicly available multi-layered reference corpus for German-language sentiment analysis. The construction of the corpus is based on the manual annotation of 270 German-language sentences considering three different layers of granularity. The sentence-layer annotation, as the most coarse-grained annotation, focuses on aspects of objectivity, subjectivity and the overall polarity of the respective sentences. Layer 2 is concerned with polarity on the word- and phrase-level, annotating both subjective and factual language. The annotations on Layer 3 focus on the expression-level, denoting frames of private states such as objective and direct speech events. These three layers and their respective annotations are intended to be fully independent of each other. At the same time, exploring for and discovering interactions that may exist between different layers should also be possible. The reliability of the respective annotations was assessed using the average pairwise agreement and Fleiss’ multi-rater measures. We believe that MLSA is a beneficial resource for sentiment analysis research, algorithms and applications that focus on the German language.
- Language
-
Englisch
- Subject
-
Deutsch
Korpus <Linguistik>
Linguistik
- Event
-
Geistige Schöpfung
- (who)
-
Clematide, Simon
Grindl, Stefan
Klenner, Manfred
Petrakis, Stefanos
Remus, Robert
Ruppenhofer, Josef
Waltinger, Ulli
Wiegand, Michael
- Event
-
Veröffentlichung
- (who)
-
European Language Resources Association
- (when)
-
2016-09-02
- URN
-
urn:nbn:de:bsz:mh39-52345
- Last update
-
06.03.2025, 9:00 AM CET
Data provider
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.
Object type
- Konferenzbeitrag
Associated
- Clematide, Simon
- Grindl, Stefan
- Klenner, Manfred
- Petrakis, Stefanos
- Remus, Robert
- Ruppenhofer, Josef
- Waltinger, Ulli
- Wiegand, Michael
- European Language Resources Association
Time of origin
- 2016-09-02