KIT | KIT-Bibliothek | Impressum | Datenschutz

FAIR Digital Object for Accessing Label Information of ML Training Data Stored in a Metadata Schema

Blumenröhr, Nicolas ORCID iD icon 1; Jejkal, Thomas ORCID iD icon 1; Pfeil, Andreas ORCID iD icon 1; Stotzka, Rainer ORCID iD icon 1
1 Scientific Computing Center (SCC), Karlsruher Institut für Technologie (KIT)

Abstract:

In this poster, we introduce how the FAIR¹ Digital Object (FAIR DO) concept can simplify the access to schema-based label information of Machine Learning (ML) training data. Training data sets from heterogeneous sources mostly have different label terms. Therefore, composing them for application in ML comes with the cost of laborious relabeling. To ease this process by automation, the FAIR DO concept can be applied. A FAIR DO is an informative representation of scientific data, e.g. an ML training data set, that makes the data interpretable and actionable for computer
systems. For applicability in the context of ML, a FAIR DO requires at least a globally unique Persistent Identifier (PID), mandatory metadata, and a data type. Label information of a training data set can be described using a proper metadata schema.
With the self-contained structure of a FAIR DO, the associated label information can be accessed. Here we show this structure and explain how it facilitates access to label information. The latter is stored in a document that is based on a custom metadata schema built for ML. ... mehr


Volltext §
DOI: 10.5445/IR/1000146625
Veröffentlicht am 20.05.2022
Cover der Publikation
Zugehörige Institution(en) am KIT Scientific Computing Center (SCC)
Publikationstyp Poster
Publikationsdatum 02.06.2022
Sprache Englisch
Identifikator KITopen-ID: 1000146625
HGF-Programm 46.21.05 (POF IV, LK 01) HMC
Veranstaltung Helmholtz Artificial Intelligence Conference (Helmholtz AI 2022), Dresden, Deutschland, 02.06.2022 – 03.06.2022
Externe Relationen Siehe auch
Schlagwörter FAIR Digital Object, Machine-Learning, Metadata Schema
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page