KIT | KIT-Bibliothek | Impressum | Datenschutz

ConExion: Concept Extraction with Large Language Models

Norouzi, Ebrahim 1; Hertling, Sven; Sack, Harald 1
1 Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB), Karlsruher Institut für Technologie (KIT)

Abstract:

In this paper, an approach for concept extraction from documents using pre-trained large language models (LLMs) is presented. Compared with conventional methods that extract keyphrases summarizing the important information discussed in a document, our approach tackles a more challenging task of extracting all present concepts related to the specific domain, not just the important ones. Through comprehensive evaluations of two widely used benchmark datasets, we demonstrate that our method improves the F1 score compared to state-of-the-art techniques. Additionally, we explore the potential of using prompts within these models for unsupervised concept extraction. The extracted concepts are intended to support domain coverage evaluation of ontologies and facilitate ontology learning, highlighting the effectiveness of LLMs in concept extraction tasks. Our source code and datasets are publicly available at https://github.com/ISE-FIZKarlsruhe/concept_extraction.


Verlagsausgabe §
DOI: 10.5445/IR/1000184349
Veröffentlicht am 29.08.2025
Scopus
Zitationen: 1
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Angewandte Informatik und Formale Beschreibungsverfahren (AIFB)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 16.06.2025
Sprache Englisch
Identifikator ISSN: 1613-0073
KITopen-ID: 1000184349
Erschienen in Joint Proceedings of the ESWC 2025 Workshops and Tutorials co-located with 22nd Extended Semantic Web Conference (ESWC 2025)
Veranstaltung 2nd International Workshop on Natural Scientific Language Processing and Research Knowledge Graphs (NSLP 2025), Portorož, Slowenien, 01.06.2025 – 02.06.2025
Verlag CEUR-WS
Serie CEUR workshop proceedings ; 3977
Externe Relationen Abstract/Volltext
Schlagwörter Concept Extraction, Present Keyphrase Extraction, Large Language Models
Nachgewiesen in Scopus
Relationen in KITopen
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page