KIT | KIT-Bibliothek | Impressum | Datenschutz

GCA-Trans: Global Context-Aware Transformer for Robust Transparent Object Segmentation in Robotic Environments

Li, Deping; Dong, Zujian ; Yang, Zilong 1; Li, Ka-Kui; Huang, Yushen
1 Fakultät für Informatik (INFORMATIK), Karlsruher Institut für Technologie (KIT)

Abstract:

Transparent object segmentation plays a critical role in indoor and outdoor scene understanding, particularly driven by the rapid advancements in autonomous driving and robotics. However, this task presents significant challenges due to the lack of distinct texture and chromatic features in transparent objects, causing their appearance to blend into the background. Existing methods face inherent architectural limitations: CNNs are restricted by limited receptive fields, while Transformer-based methods may inadvertently suppress the weak feature details of transparent surfaces due to the inherent low-pass filtering property of self-attention mechanisms, treating them as background noise. Consequently, these approaches struggle to consistently segment transparent objects across diverse scales, failing to preserve both fine details and large-scale structures. To address these limitations, we propose the Global Context-Aware Transformer (GCA-Trans). Specifically, we design a Multi-scale Context Mining (MCM) module that leverages parallel dilated convolutions with varying receptive fields to simultaneously extract features at multiple scales. ... mehr


Verlagsausgabe §
DOI: 10.5445/IR/1000193791
Veröffentlicht am 05.06.2026
Cover der Publikation
Zugehörige Institution(en) am KIT Fakultät für Informatik (INFORMATIK)
Publikationstyp Zeitschriftenaufsatz
Publikationsjahr 2026
Sprache Englisch
Identifikator ISSN: 2313-433X
KITopen-ID: 1000193791
Erschienen in Journal of Imaging
Verlag MDPI
Band 12
Heft 5
Seiten Art.-Nr.: 212
Vorab online veröffentlicht am 16.05.2026
Externe Relationen Siehe auch
Nachgewiesen in OpenAlex
Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page