KIT | KIT-Bibliothek | Impressum | Datenschutz

Do Large Language Models Contain Software Architectural Knowledge? An Exploratory Case Study with GPT

Soliman, Mohamed; Keim, Jan ORCID iD icon 1
1 Institut für Informationssicherheit und Verlässlichkeit (KASTEL), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

Architectural knowledge (AK) of existing systems is essential for software engineers to make design decisions. Recently, Large Language Models (LLMs) trained on large-scale datasets, including software repositories, have shown promise in embedding knowledge and answering questions. However, LLMs have not been evaluated for their abilities to answer questions about AK, leaving doubts about their accuracy. This paper assesses GPT, a leading LLM, by evaluating its responses' accuracy, quality, and trustworthiness on the AK of the large-scale open-source system HDFS. We conducted an exploratory case study with 14 software engineers who posed questions to GPT and compared its responses to a predefined ground truth. The engineers rated GPT’s answers with moderate quality and trustworthiness. Our findings on GPT´s accuracy indicates moderate recall but lower precision, especially in identifying quality attribute solutions and design rationales. These results suggest that while GPT and similar models can provide initial insights into AK, expert validation remains necessary for reliability. This study underscores LLMs' potential and limitations to discover software AK.


Postprint §
DOI: 10.5445/IR/1000178724
Veröffentlicht am 14.02.2025
Originalveröffentlichung
DOI: 10.1109/ICSA65012.2025.00012
Scopus
Zitationen: 2
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Informationssicherheit und Verlässlichkeit (KASTEL)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 31.03.2025
Sprache Englisch
Identifikator ISBN: 979-8-3315-2091-5
KITopen-ID: 1000178724
HGF-Programm 46.23.01 (POF IV, LK 01) Methods for Engineering Secure Systems
Erschienen in 2025 IEEE 22nd International Conference on Software Architecture (ICSA)
Veranstaltung 22nd IEEE International Conference on Software Architecture (ICSA 2025), Ottensee, Dänemark, 31.03.2025 – 04.04.2025
Verlag Institute of Electrical and Electronics Engineers (IEEE)
Seiten 13–24
Schlagwörter Architecture knowledge, Architecture design decisions, Large Language Models, GPT
Nachgewiesen in Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page