Konferenzbeitrag
POS tagset refinement for linguistic analysis and the impact on statistical parsing
The annotation of parts of speech (POS) in linguistically annotated corpora is a fundamental annotation layer which provides the basis for further syntactic analyses, and many NLP tools rely on POS information as input. However, most POS annotation schemes have been developed with written (newspaper) text in mind and thus do not carry over well to text from other domains and genres. Recent discussions have concentrated on the shortcomings of present POS annotation schemes with regard to their applicability to data from domains other than newspaper text.
- Sprache
-
Englisch
- Thema
-
Korpus <Linguistik>
Parts of speech
Syntaktische Analyse
Annotation
Germanische Sprachen; Deutsch
- Ereignis
-
Geistige Schöpfung
- (wer)
-
Rehbein, Ines
Hirschmann, Hagen
- Ereignis
-
Veröffentlichung
- (wer)
-
Tübingen : University of Tübingen
- (wann)
-
2018-10-04
- URN
-
urn:nbn:de:bsz:mh39-80368
- Letzte Aktualisierung
-
06.03.2025, 09:00 MEZ
Datenpartner
Leibniz-Institut für Deutsche Sprache - Bibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.
Objekttyp
- Konferenzbeitrag
Beteiligte
- Rehbein, Ines
- Hirschmann, Hagen
- Tübingen : University of Tübingen
Entstanden
- 2018-10-04