KIT | KIT-Bibliothek | Impressum | Datenschutz

The data set knowledge graph: Creating a linked open data source for data sets

Färber, M. ORCID iD icon 1; Lamprecht, D. 1
1 Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB), Karlsruher Institut für Technologie (KIT)

Abstract:

Several scholarly knowledge graphs have been proposed to model and analyze the academic landscape. However, although the number of data sets has increased remarkably in recent years, these knowledge graphs do not primarily focus on data sets but rather on associated entities such as publications. Moreover, publicly available data set knowledge graphs do not systematically contain links to the publications in which the data sets are mentioned. In this paper, we present an approach for constructing an RDF knowledge graph that fulfills these mentioned criteria. Our data set knowledge graph, DSKG, is publicly available at http://dskg.org and contains metadata of data sets for all scientific disciplines. To ensure high data quality of the DSKG, we first identify suitable raw data set collections for creating the DSKG. We then establish links between the data sets and publications modeled in the Microsoft Academic Knowledge Graph that mention these data sets. As the author names of data sets can be ambiguous, we develop and evaluate a method for author name disambiguation and enrich the knowledge graph with links to ORCID. Overall, our knowledge graph contains more than 2,000 data sets with associated properties, as well as 814,000 links to 635,000 scientific publications. ... mehr


Verlagsausgabe §
DOI: 10.5445/IR/1000143272
Veröffentlicht am 02.03.2022
Originalveröffentlichung
DOI: 10.1162/qss_a_00161
Scopus
Zitationen: 10
Dimensions
Zitationen: 16
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2022
Sprache Englisch
Identifikator ISSN: 2641-3337
KITopen-ID: 1000143272
Erschienen in Quantitative Science Studies
Verlag MIT Press
Band 2
Heft 4
Seiten 1324-1355
Bemerkung zur Veröffentlichung Gefördert durch den KIT-Publikationsfonds
Nachgewiesen in Dimensions
Scopus
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page