KIT | KIT-Bibliothek | Impressum | Datenschutz

Privacy Policy Feature Extraction for Direct- to-Consumer Genetic Testing

Faust, Luisa

Abstract (englisch):

Background: Direct-to-consumer genetic testing (DTC-GT) has gained popularity in recent years due to social media marketing. However, while DTC-GT services collect highly sensitive genetic and biological data, their privacy policies are often lengthy,
vague and difficult to understand for consumers, leading to privacy concerns. Existing research deals with the analysis of privacy policies in general but there is no domain-specific, automated tool to extract and assess privacy policy content in DTC-GT. This paper aims to close that gap, presenting a framework for the automated analysis of privacy policies of DTC-GT companies using the Longformer model.
Objective: The study focuses on the extraction of content of DTC-GT privacy policies by dividing them into 22 categories adapted to DTC-GT. Furthermore, an evaluation framework across multiple dimensions is developed to assess the extraction of critical privacy related features.
Methods: The model training consists of three steps, including the already existing pretraining of the Longformer model, an unsupervised domain adaption and a supervised fine-tuning. The unsupervised adaption uses 16,062 health-related privacy policies to provide exposure to domain-specific legal terminology. ... mehr


Verlagsausgabe §
DOI: 10.5445/IR/1000185402
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
Publikationstyp Buchaufsatz
Publikationsjahr 2025
Sprache Englisch
Identifikator KITopen-ID: 1000185621
Erschienen in cii Student Papers 2025. Ed.: A. Sunyaev
Verlag Karlsruher Institut für Technologie (KIT)
Seiten 42-60
Relationen in KITopen
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page