KIT | KIT-Bibliothek | Impressum | Datenschutz

Autonomously Detecting Defects in Circular Production Using Multimodal Large Language Models and Retrieval Augmentation

Koch, Dominik ORCID iD icon 1; Ohland, Marvin; Wen, Di ORCID iD icon 2; Peng, Kunyu ORCID iD icon 2; Benfer, Martin ORCID iD icon 1; Stiefelhagen, Rainer ORCID iD icon 2; Lanza, Gisela 1
1 Institut für Produktionstechnik (WBK), Karlsruher Institut für Technologie (KIT)
2 Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract:

Automated defect detection is a key enabler for scalable inspection in circular production, where varying component conditions and the scarcity of large-scale annotated datasets challenge conventional supervised vision models. This paper investigates whether multimodal large language models (MLLMs) can serve as a flexible alternative for industrial defect detection without task specific training. To bridge the domain gap of general-purpose models, several retrieval-augmented generation (RAG) architectures are designed to leverage ex pert-defined examples for visual defect detection on used gears in a circular pro duction context. The study compares baseline prompting with text-based, image based, and self-reflective retrieval strategies that dynamically provide context relevant examples and descriptions during inference. The results show that MLLMs already achieve good defect detection accuracy out-of-the-box. Integrat ing RAG increases defect detection accuracy to 95.83% and further improves fine-grained defect classification, with gains of up to 63.94% for specific rare defect types. While the models remain weaker in precise localization and still lack deeper object-level understanding, the results indicate that RAG-enhanced MLLMs are a viable, low-barrier solution for inspection tasks in remanufacturing scenarios with limited expert supervision and diverse object classes. ... mehr


Volltext §
DOI: 10.5445/IR/1000193853
Veröffentlicht am 03.06.2026
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Institut für Produktionstechnik (WBK)
Fakultät für Maschinenbau – Institut für Werkzeugmaschinen und Betriebstechnik (wbk)
Publikationstyp Forschungsbericht/Preprint
Publikationsjahr 2026
Sprache Englisch
Identifikator KITopen-ID: 1000193853
Umfang 10
Bemerkung zur Veröffentlichung Conference on Circular Production Systems and Technologies (CCPT) 2026, Karlsruhe, 8th June 2026
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page