Do Large Language Models Contain Software Architectural Knowledge? An Exploratory Case Study with GPT

Soliman, Mohamed; Keim, Jan

Do Large Language Models Contain Software Architectural Knowledge? An Exploratory Case Study with GPT

¹
¹ Institut für Informationssicherheit und Verlässlichkeit (KASTEL), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

Architectural knowledge (AK) of existing systems is essential for software engineers to make design decisions. Recently, Large Language Models (LLMs) trained on large-scale datasets, including software repositories, have shown promise in embedding knowledge and answering questions. However, LLMs have not been evaluated for their abilities to answer questions about AK, leaving doubts about their accuracy. This paper assesses GPT, a leading LLM, by evaluating its responses' accuracy, quality, and trustworthiness on the AK of the large-scale open-source system HDFS. We conducted an exploratory case study with 14 software engineers who posed questions to GPT and compared its responses to a predefined ground truth. The engineers rated GPT’s answers with moderate quality and trustworthiness. Our findings on GPT´s accuracy indicates moderate recall but lower precision, especially in identifying quality attribute solutions and design rationales. These results suggest that while GPT and similar models can provide initial insights into AK, expert validation remains necessary for reliability. This study underscores LLMs' potential and limitations to discover software AK.

Zugehörige Institution(en) am KIT	Institut für Informationssicherheit und Verlässlichkeit (KASTEL)
Publikationstyp	Proceedingsbeitrag
Publikationsjahr	2025
Sprache	Englisch
Identifikator	KITopen-ID: 1000178724
HGF-Programm	46.23.01 (POF IV, LK 01) Methods for Engineering Secure Systems
Erschienen in	22nd IEEE International Conference on Software Architecture (ICSA 2025)
Veranstaltung	22nd IEEE International Conference on Software Architecture (ICSA 2025), Ottensee, Dänemark, 31.03.2025 – 04.04.2025
Verlag	Institute of Electrical and Electronics Engineers (IEEE)
Schlagwörter	Architecture knowledge, Architecture design decisions, Large Language Models, GPT

KITopen-Download

DOI: 10.5445/IR/1000178724

Frei zugänglich ab 05.04.2025

Export

Statistiken

Seitenaufrufe: 130
seit 05.02.2025

Repository KITopen

Do Large Language Models Contain Software Architectural Knowledge? An Exploratory Case Study with GPT

Abstract (englisch):