KIT | KIT-Bibliothek | Impressum | Datenschutz

A LLM-based voice user interface for voice dialogues between user and industrial machines

Mukherjee, Avik ; Karande, Abhijit; Häfner, Polina ORCID iD icon 1; Poonia, Manish Dev; Kimmig, Andreas; Kreuzwieser, Simon; Vlas, Richard; Klar, Matthias; Sykora, Thomas; Grethler, Michael 1
1 Institut für Informationsmanagement im Ingenieurwesen (IMI), Karlsruher Institut für Technologie (KIT)

Abstract:

Recent advancements in Large Language Models (LLMs) have significantly expanded the role of Artificial Intelligence (AI) in manufacturing. One promising area for LLM integration is machine control, particularly for industrial equipment like milling, drilling, and turning machines. Despite their critical role, these machines often exhibit limitations in user-friendliness, operational flexibility, accessibility, and operator safety which are partially based on console-based interfaces. Voice user interfaces (VUIs) mitigate those limitations. The current literature on VUIs highlights challenges like constrained instruction sets, voice recognition, and reliance on physical machines for testing. The proposed approach addresses these challenges by using LLMs, and virtual prototypes of physical machines, for intensive training and testing of the VUI. The paper describes this approach, explores application challenges, identifies key implementation limitations. This article also provides a comparison of accuracies of 5 pre-trained transformer based language models for understanding a set of sample commands that can be issued to the milling machine providing idea on the usability of pre-trained transformer models as the core conversational-component of the conceptualised VUI.


Verlagsausgabe §
DOI: 10.5445/IR/1000184412
Veröffentlicht am 04.09.2025
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Informationsmanagement im Ingenieurwesen (IMI)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2025
Sprache Englisch
Identifikator ISSN: 2212-8271
KITopen-ID: 1000184412
Erschienen in Procedia CIRP
Verlag Elsevier
Band 134
Seiten 378 – 383
Nachgewiesen in Scopus
OpenAlex
Dimensions
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page