Konferenzbeitrag

Enhancing the quality of metadata by using authority control

The Component MetaData Infrastructure (CMDI) is the dominant framework for describing language resources according to ISO 24622 (ISO/TC 37/SC 4, 2015). Within the CLARIN world, CMDI has become a huge success. The Virtual Language Observatory (VLO) now holds over 800.000 resources, all described with CMDI-based metadata. With the metadata being harvested from about thirty centres, there is a considerable amount of heterogeneity in the data. In part, there is some use of controlled vocabularies to keep data heterogeneity in check, say when describing the type of a resource, or the country the resource is originating from. However, when CMDI data refers to the names of persons or organisations, strings are used in a rather uncontrolled manner. Here, the CMDI community can learn from libraries and archives who maintain standardised lists for all kinds of names. In this paper, we advocate the use of freely available authority files that support the unique identification of persons, organisations, and more. The systematic use of authority records enhances the quality of the metadata, hence improves the faceted browsing experience in the VLO, and also prepares the sharing of CMDI-based metadata with the data in library catalogues.

Enhancing the quality of metadata by using authority control

Urheber*in: Trippel, Thorsten; Zinn, Claus

Attribution 4.0 International

0
/
0

Language
Englisch

Subject
Metadaten
Normung
Normdatei
Bibliothekskatalog
Bibliothek
Datenqualität
Bibliografische Daten
Sprache

Event
Geistige Schöpfung
(who)
Trippel, Thorsten
Zinn, Claus
Event
Veröffentlichung
(who)
Paris : European Language Resources Association (ELRA)
Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
(when)
2022-01-07

URN
urn:nbn:de:bsz:mh39-108572
Last update
06.03.2025, 9:00 AM CET

Data provider

This object is provided by:
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.

Object type

  • Konferenzbeitrag

Associated

  • Trippel, Thorsten
  • Zinn, Claus
  • Paris : European Language Resources Association (ELRA)
  • Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

Time of origin

  • 2022-01-07

Other Objects (12)