Konferenzbeitrag
Enhancing the quality of metadata by using authority control
The Component MetaData Infrastructure (CMDI) is the dominant framework for describing language resources according to ISO 24622 (ISO/TC 37/SC 4, 2015). Within the CLARIN world, CMDI has become a huge success. The Virtual Language Observatory (VLO) now holds over 800.000 resources, all described with CMDI-based metadata. With the metadata being harvested from about thirty centres, there is a considerable amount of heterogeneity in the data. In part, there is some use of controlled vocabularies to keep data heterogeneity in check, say when describing the type of a resource, or the country the resource is originating from. However, when CMDI data refers to the names of persons or organisations, strings are used in a rather uncontrolled manner. Here, the CMDI community can learn from libraries and archives who maintain standardised lists for all kinds of names. In this paper, we advocate the use of freely available authority files that support the unique identification of persons, organisations, and more. The systematic use of authority records enhances the quality of the metadata, hence improves the faceted browsing experience in the VLO, and also prepares the sharing of CMDI-based metadata with the data in library catalogues.
- Language
-
Englisch
- Subject
-
Metadaten
Normung
Normdatei
Bibliothekskatalog
Bibliothek
Datenqualität
Bibliografische Daten
Sprache
- Event
-
Geistige Schöpfung
- (who)
-
Trippel, Thorsten
Zinn, Claus
- Event
-
Veröffentlichung
- (who)
-
Paris : European Language Resources Association (ELRA)
Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
- (when)
-
2022-01-07
- URN
-
urn:nbn:de:bsz:mh39-108572
- Last update
-
06.03.2025, 9:00 AM CET
Data provider
Leibniz-Institut für Deutsche Sprache - Bibliothek. If you have any questions about the object, please contact the data provider.
Object type
- Konferenzbeitrag
Associated
- Trippel, Thorsten
- Zinn, Claus
- Paris : European Language Resources Association (ELRA)
- Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
Time of origin
- 2022-01-07