KIT | KIT-Bibliothek | Impressum | Datenschutz

Snap, Segment, Deploy: A Visual Data and Detection Pipeline for Wearable Industrial Assistants

Wen, Di ORCID iD icon 1; Zheng, Junwei 1; Liu, Ruiping 1; Xu, Yi 1; Peng, Kunyu ORCID iD icon 1; Stiefelhagen, Rainer ORCID iD icon 1
1 Institut für Anthropomatik und Robotik (IAR), Karlsruher Institut für Technologie (KIT)

Abstract:

Industrial assembly requires rapid adaptation to complex procedures under constrained computing, connectivity, and privacy conditions, rendering cloud-based solutions impractical. We present an on-device assistant for real-time, semi-hands-free guidance, integrating lightweight detection, speech recognition, and retrieval-augmented response generation. To enable scalable training without manual labeling, we construct the Gear8 dataset via an automated pipeline and introduce a two-stage refinement strategy to enhance robustness against domain shift. Experiments show improved generalization under diverse corruptions. User studies confirm notable gains in efficiency and error reduction, underscoring the system’s suitability for real-world industrial deployment. The Gear8 dataset, models, and code are publicly available at: https://github.com/Kratos-Wen/Gear8.


Originalveröffentlichung
DOI: 10.1109/SMC58881.2025.11342511
Zugehörige Institution(en) am KIT Institut für Anthropomatik und Robotik (IAR)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 05.10.2025
Sprache Englisch
Identifikator ISBN: 979-8-3315-3358-8
ISSN: 1062-922X
KITopen-ID: 1000191910
Erschienen in 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC)
Veranstaltung IEEE International Conference on Systems, Man, and Cybernetics (SMC 2025), Wien, Österreich, 05.10.2025 – 08.10.2025
Verlag Institute of Electrical and Electronics Engineers (IEEE)
Seiten 1270 - 1276
Nachgewiesen in OpenAlex
Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page