KIT | KIT-Bibliothek | Impressum | Datenschutz

EMUFormer: Efficient Multi-task Uncertainties for Reliable Joint Semantic Segmentation and Monocular Depth Estimation

Landgraf, Steven ORCID iD icon 1; Hillemann, Markus ORCID iD icon 1; Kapler, Theodor 1; Ulrich, Markus ORCID iD icon 1
1 Institut für Photogrammetrie und Fernerkundung (IPF), Karlsruher Institut für Technologie (KIT)

Abstract:

Quantifying the predictive uncertainty emerged as a possible solution to common challenges like overconfidence, lack of explainability, and robustness of deep neural networks, albeit one that is often computationally expensive. Many real-world applications are multi-modal in nature and hence benefit from multi-task learning. In autonomous driving or robotics, for example, the joint solution of semantic segmentation and monocular depth estimation has proven to be valuable. To this end, we introduce EMUFormer, a novel student-teacher distillation approach for efficient multi-task uncertainties in the context of joint semantic segmentation and monocular depth estimation. By leveraging the predictive uncertainties of the teacher, EMUFormer achieves new state-of-the-art results on Cityscapes and NYUv2 and additionally estimates reliable predictive uncertainties for both tasks that are comparable or superior to a Deep Ensemble despite being an order of magnitude more efficient to compute. These findings even extend to out-of-domain and domain adaptation scenarios, highlighting EMUFormer’s remarkable reliability


Verlagsausgabe §
DOI: 10.5445/IR/1000191242
Veröffentlicht am 10.03.2026
Originalveröffentlichung
DOI: 10.1007/s11263-026-02751-0
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Photogrammetrie und Fernerkundung (IPF)
Publikationstyp Zeitschriftenaufsatz
Publikationsmonat/-jahr 04.2026
Sprache Englisch
Identifikator ISSN: 0920-5691, 1573-1405
KITopen-ID: 1000191242
Erschienen in International Journal of Computer Vision
Verlag Springer
Band 134
Heft 4
Seiten Article no: 142
Vorab online veröffentlicht am 06.03.2026
Schlagwörter Uncertainty Quantification · Semantic Segmentation · Monocular Depth Estimation · Knowledge Distillation ·, Out-of-Domain · Domain adaptation
Nachgewiesen in OpenAlex
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page