KIT | KIT-Bibliothek | Impressum | Datenschutz

Navigating the Synthetic Realm: Harnessing Diffusion-Based Models for Laparoscopic Text-to-Image Generation

Allmendinger, Simeon ; Hemmer, Patrick; Queisner, Moritz; Sauer, Igor; Müller, Leopold; Jakubik, Johannes; Vössing, Michael; Kühl, Niklas

Abstract (englisch):

Recent advances in synthetic imaging open up opportunities for obtaining additional data in the field of surgical imaging. This data can provide reliable supplements supporting surgical applications and decision-making through computer vision. Particularly the field of image-guided surgery, such as laparoscopic and robotic-assisted surgery, benefits strongly from synthetic image datasets and virtual surgical training methods. Our study presents an intuitive approach for generating synthetic laparoscopic images from short text prompts using diffusion-based generative models. We demonstrate the usage of state-of-the-art text-to-image architectures in the context of laparoscopic imaging with regard to the surgical removal of the gallbladder. Results on fidelity and diversity demonstrate that diffusion-based models can acquire knowledge about the style and semantics of image-guided surgery. A validation study with a human assessment survey underlines the realistic nature of our synthetic data, as medical personnel detects actual images in a pool with generated images causing a false-positive rate of 66%. In addition, the investigation of a state-of-the-art machine learning model to recognize surgical actions indicates enhanced results when trained with additional generated images of up to 5.20%. ... mehr


Zugehörige Institution(en) am KIT Institut für Wirtschaftsinformatik und Marketing (IISM)
Karlsruhe Service Research Institute (KSRI)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 23.08.2024
Sprache Englisch
Identifikator ISBN: 978-3-031-63591-5
KITopen-ID: 1000174657
Erschienen in AI for Health Equity and Fairness : Leveraging AI to Address Social Determinants of Health, Hrsg.: Arash Shaban-Nejad, Martin Michalowski, Simone Bianco
Veranstaltung 8th International Workshop on Health Intelligence (W3PHIAI-24 2024), Vancouver, Kanada, 26.02.2024 – 27.02.2024
Verlag Springer International Publishing
Seiten 31-46
Serie Studies in Computational Intelligence (SCI) ; 1164
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page