Konferenzbeitrag

Towards automatic quality assessment of component metadata

Measuring the quality of metadata is only possible by assessing the quality of the underlying schema and the metadata instance. We propose some factors that are measurable automatically for metadata according to the CMD framework, taking into account the variability of schemas that can be defined in this framework. The factors include among others the number of elements, the (re-)use of reusable components, the number of filled in elements. The resulting score can serve as an indicator of the overall quality of the CMD instance, used for feedback to metadata providers or to provide an overview of the overall quality of metadata within a repository. The score is independent of specific schemas and generalizable. An overall assessment of harvested metadata is provided in form of statistical summaries and the distribution, based on a corpus of harvested metadata. The score is implemented in XQuery and can be used in tools, editors and repositories.

Towards automatic quality assessment of component metadata

Urheber*in: Trippel, Thorsten; Broeder, Daan; Durco, Matej; Ohren, Oddrun

Namensnennung - Nicht kommerziell 4.0 International

Sprache
Englisch

Thema
Metadaten
Datenqualität
Dokumentenserver
Datenmanagement
Computerlinguistik
Sprache

Ereignis
Geistige Schöpfung
(wer)
Trippel, Thorsten
Broeder, Daan
Durco, Matej
Ohren, Oddrun
Ereignis
Veröffentlichung
(wer)
Paris : European Language Resources Association (ELRA)
Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)
(wann)
2022-01-11

URN
urn:nbn:de:bsz:mh39-108619
Letzte Aktualisierung
06.03.2025, 09:00 MEZ

Datenpartner

Dieses Objekt wird bereitgestellt von:
Leibniz-Institut für Deutsche Sprache - Bibliothek. Bei Fragen zum Objekt wenden Sie sich bitte an den Datenpartner.

Objekttyp

  • Konferenzbeitrag

Beteiligte

  • Trippel, Thorsten
  • Broeder, Daan
  • Durco, Matej
  • Ohren, Oddrun
  • Paris : European Language Resources Association (ELRA)
  • Mannheim : Leibniz-Institut für Deutsche Sprache (IDS)

Entstanden

  • 2022-01-11

Ähnliche Objekte (12)