Konferenzbeitrag

Trailblazing through forests of resources in linguistics

Linguistics is facing the challenge of many other sciences as it continues to grow into increasingly complex subfields, each with its own separate or overarching branches. While linguists are certainly aware of the overall structure of the research field, they cannot follow all developments other than those of their subfields. It is thus important to help specialists but also newcomers alike to bushwhack through evolved or unknown territory of linguistic data. A considerable amount of research data in linguistics is described with metadata. While studies described and published in archived journals and conference proceedings receive a quite homogeneous set of metadata tags — e.g., author, title, publisher —, this does not hold for the empirical data and analyses that underlie such studies. Moreover, lexicons, grammars, experimental data, and other types of resources come in different forms; and to make things worse, their description in terms of metadata is also not uniform, if existing at all. These problems are well-known and there are now a number of international initiatives — e.g., CLARIN, FlareNet, MetaNet, DARIAH — to build infrastructures for managing linguistic resources. The NaLiDa project, funded by the German Research Foundation, aims at facilitating the management and access to linguistic resources originating from German research institutions. In cooperation with the German SFB 833 research center, we are developing a combination of faceted and full-text search to give integrated access through heterogeneous metadata sets. Our approach is supported by a central registry for metadata field descriptors, and a component repository for structured groups of data categories as larger building blocks.

Trailblazing through forests of resources in linguistics

Urheber*in: Barkey, Reinhild; Hinrichs, Erhard; Hoppermann, Christina; Trippel, Thorsten; Zinn, Claus

In copyright

Language
Englisch

Subject
Digital Humanities
Forschungsdaten
Metadaten
Datenmanagement
Computerlinguistik
Sprache

Event
Geistige Schöpfung
(who)
Barkey, Reinhild
Hinrichs, Erhard
Hoppermann, Christina
Trippel, Thorsten
Zinn, Claus
Event
Veröffentlichung
(who)
Stanford : Stanford University Library
Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
(when)
2022-02-04

URN
urn:nbn:de:bsz:mh39-109046
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Konferenzbeitrag

Associated

  • Barkey, Reinhild
  • Hinrichs, Erhard
  • Hoppermann, Christina
  • Trippel, Thorsten
  • Zinn, Claus
  • Stanford : Stanford University Library
  • Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

Time of origin

  • 2022-02-04

Other Objects (12)