KIT | KIT-Bibliothek | Impressum | Datenschutz

LiSSA: Toward Generic Traceability Link Recovery through Retrieval-Augmented Generation

Fuchß, Dominik ORCID iD icon 1; Hey, Tobias ORCID iD icon 1; Keim, Jan ORCID iD icon 1; Liu, Haoyu ORCID iD icon 1; Ewald, Niklas 1; Thirolf, Tobias 1; Koziolek, Anne ORCID iD icon 1
1 Institut für Informationssicherheit und Verlässlichkeit (KASTEL), Karlsruher Institut für Technologie (KIT)

Abstract (englisch):

There are a multitude of software artifacts which need to be handled during the development and maintenance of a software system. These artifacts interrelate in multiple, complex ways.
Therefore, many software engineering tasks are enabled — and even empowered — by a clear understanding of artifact interrelationships and also by the continued advancement of techniques for automated artifact linking.

However, current approaches in automatic Traceability Link Recovery (TLR) target mostly the links between specific sets of artifacts, such as those between requirements and code.
Fortunately, recent advancements in Large Language Models (LLMs) can enable TLR approaches to achieve broad applicability.
Still, it is a nontrivial problem how to provide the LLMs with the specific information needed to perform TLR.

In this paper, we present LiSSA, a framework that harnesses LLM performance and enhances them through Retrieval-Augmented Generation (RAG).
We empirically evaluate LiSSA on three different TLR tasks, requirements to code, documentation to code, and architecture documentation to architecture models, and we compare our approach to state-of-the-art approaches.
... mehr

Zugehörige Institution(en) am KIT Institut für Informationssicherheit und Verlässlichkeit (KASTEL)
Publikationstyp Forschungsbericht/Preprint
Publikationsjahr 2025
Sprache Englisch
Identifikator ISSN: 1558-1225
KITopen-ID: 1000178348
HGF-Programm 46.23.01 (POF IV, LK 01) Methods for Engineering Secure Systems
Verlag Institute of Electrical and Electronics Engineers (IEEE)
Serie Conference Proceedings
Projektinformation SFB 1608/1 (DFG, DFG KOORD, SFB 1608)
Schlagwörter Traceability Link Recovery, Large Language Models, Retrieval-Augmented Generation
Relationen in KITopen

Volltext §
DOI: 10.5445/IR/1000178348
Veröffentlicht am 22.01.2025
Seitenaufrufe: 289
seit 22.01.2025
Downloads: 238
seit 22.01.2025
Cover der Publikation
KIT – Die Forschungsuniversität in der Helmholtz-Gemeinschaft
KITopen Landing Page