KIT | KIT-Bibliothek | Impressum | Datenschutz

Using Large Language Models for Extracting Structured Information From Scientific Texts

Rettenberger, Luca ORCID iD icon 1; Münker, Marc F. 2; Schutera, Mark 1; Niemeyer, Christof M. ORCID iD icon 2; Rabe, Kersten S. ORCID iD icon 2; Reischl, Markus ORCID iD icon 1
1 Institut für Automation und angewandte Informatik (IAI), Karlsruher Institut für Technologie (KIT)
2 Institut für Biologische Grenzflächen (IBG), Karlsruher Institut für Technologie (KIT)

Abstract:

Extracting structured information from scientificworks is challenging as sought parameters or properties areoften scattered across lengthy texts. We introduce a novel it-erative approach using Large Language Models (LLMs) toautomate this process. Our method first condenses scientificliterature, preserving essential information in a dense format,then retrieves predefined attributes. As a biomedical applica-tion example, our concept is employed to extract experimentalparameters for preparing Metal-Organic Frameworks (MOFs)from scientific work to enable complex and information-richapplications in the biotechnology-oriented life sciences. Ouropen-source method automates extracting information fromverbose texts, converting them into structured and easily navi-gable data. This considerably improves scientific literature re-search by utilizing the power of LLMs and paves the way forenhanced and faster information extraction from extensive sci-entific texts.


Verlagsausgabe §
DOI: 10.5445/IR/1000177529
Veröffentlicht am 18.12.2024
Originalveröffentlichung
DOI: 10.1515/cdbme-2024-2129
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Automation und angewandte Informatik (IAI)
Institut für Biologische Grenzflächen (IBG)
Publikationstyp Zeitschriftenaufsatz
Publikationsdatum 01.12.2024
Sprache Englisch
Identifikator ISSN: 2364-5504
KITopen-ID: 1000177529
HGF-Programm 43.31.02 (POF IV, LK 01) Devices and Applications
Erschienen in Current Directions in Biomedical Engineering
Verlag De Gruyter
Band 10
Heft 4
Seiten 526–529
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page