Konferenzbeitrag

Modelling Linguistic Data Structures

Linguistic corpora have been annotated by means of SGML-based markup languages for almost 20 years. We can, very roughly, differentiate between three distinct evolutionary stages of markup technologies. (1)Originally, single SGML tree-based document instances were deemed sufficient for the representation of linguistic structures. (2) Linguists began to realize that alternatives and extensions to the traditional model are needed. Formalisms such as, for example, NITE were proposed: the NITE Object Model (NOM) consists of multi-rooted trees. (3) We are now on the threshold of the third evolutionary stage: even NITE's very flexible approach is not suited for all linguistic purposes. As some structures, such as these, cannot be modeled by multi-rooted trees, an even more flexible approach is needed in order to provide a generic annotation format that is able to represent genuinely arbitrary linguistic data structures.

Modelling Linguistic Data Structures

Urheber*in: Wörner, Kai; Witt, Andreas; Rehm, Georg; Dipper, Stefanie

Urheberrechtsschutz

0
/
0

Sprache
Englisch

Thema
Linguistik

Ereignis
Geistige Schöpfung
(wer)
Wörner, Kai
Witt, Andreas
Rehm, Georg
Dipper, Stefanie
Ereignis
Veröffentlichung
(wer)
Montreal : Extreme Markup Languages Conference
(wann)
2015-12-22

URN
urn:nbn:de:bsz:mh39-45173
Letzte Aktualisierung
06.03.2025, 09:00 MEZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
Leibniz-Institut für Deutsche Sprache - Bibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Objekttyp

  • Konferenzbeitrag

Beteiligte

  • Wörner, Kai
  • Witt, Andreas
  • Rehm, Georg
  • Dipper, Stefanie
  • Montreal : Extreme Markup Languages Conference

Entstanden

  • 2015-12-22

Ähnliche Objekte (12)