KIT | KIT-Bibliothek | Impressum | Datenschutz

Evaluating Latent Generative Paradigms for High-Fidelity 3D Shape Completion from a Single Depth Image

Humt, Matthias; Hillenbrand, Ulrich; Triebel, Rudolph 1
1 Fakultät für Informatik (INFORMATIK), Karlsruher Institut für Technologie (KIT)

Abstract:

While generative models have seen significant adoption across a wide range of data modalities, including 3D data, a consensus on which model is best suited for which task has yet to be reached. Further, conditional information such as text and images to steer the generation process are frequently employed, whereas others, like partial 3D data, have not been thoroughly evaluated. In this work, we compare two of the most promising generative modelsDenoising Diffusion Probabilistic Models and Autoregressive Causal Transformers-which we adapt for the tasks of generative shape modeling and completion. We conduct a thorough quantitative evaluation and comparison of both tasks, including a baseline discriminative model and an extensive ablation study. Our results show that (1) the diffusion model with continuous latents outperforms both the discriminative model and the autoregressive approach and delivers state-of-the-art performance on multi-modal shape completion from a single, noisy depth image under realistic conditions and (2) when compared on the same discrete latent space, the autoregressive model can match or exceed diffusion performance on these tasks.


Download
Originalveröffentlichung
DOI: 10.1109/3DV69130.2026.00144
Zugehörige Institution(en) am KIT Fakultät für Informatik (INFORMATIK)
Publikationstyp Proceedingsbeitrag
Publikationsdatum 20.03.2026
Sprache Englisch
Identifikator ISBN: 979-8-3315-7312-6
KITopen-ID: 1000194741
Erschienen in 2026 International Conference on 3D Vision (3DV)
Veranstaltung International Conference on 3D Vision (3DV 2026), Vancouver, Kanada, 20.03.2026 – 23.03.2026
Verlag Institute of Electrical and Electronics Engineers (IEEE)
Seiten 1511 - 1521
Externe Relationen Siehe auch
Schlagwörter 3d shape completion, latent generative models, single-view reconstruction, denoising diffusion probabilistic models, autoregressive transformers
Nachgewiesen in OpenAlex
Scopus
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page