KIT | KIT-Bibliothek | Impressum | Datenschutz

Accessible data lineage: a scoping review on open-source data lineage platforms

Hariharan, Anuja ORCID iD icon 1; Zhang, Tianren; Motz, Marvin ORCID iD icon 1; Weinhardt, Christof ORCID iD icon 1
1 Institut für Wirtschaftsinformatik und Marketing (IISM), Karlsruher Institut für Technologie (KIT)

Abstract:

In the contemporary landscape of data-driven enterprises, establishing data lineage in data transactions can be challenging yet a necessity, due to emerging compliance laws. While there are several commercial data lineage platforms, organizations are unable to successfully employ data lineage methods in their data ecosystem due to accessibility issues, insufficient information on the underlying lineage method, and lack of information on coverage of data lineage taxonomies. In this work, we conduct a structured scoping review using the PRISMA-ScR guidelines, to analyze to what extent current open-source platforms address aspects of data lineage. We adapted well-known data lineage taxonomies, and summarized which aspects of data lineage are addressed. The scoping review highlights the need for open-source lineage platforms that intelligently deduce lineage where meta-data is not available and further research to support inter-organizational data transactions. We draw insights for future areas of research in data lineage, both for practitioners and researchers.

Zugehörige Institution(en) am KIT Institut für Wirtschaftsinformatik und Marketing (IISM)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 17.12.2024
Sprache Englisch
Identifikator ISBN: 978-1-958200-13-1
KITopen-ID: 1000180392
Erschienen in Proceedings of the 45th International Conference on Information Systems
Veranstaltung 45th International Conference on Information Systems (ICIS 2024), Bangkok, Thailand, 15.12.2024 – 18.12.2024

Seitenaufrufe: 22
seit 24.03.2025
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page