KIT | KIT-Bibliothek | Impressum | Datenschutz

GS4City: Hierarchical Semantic Gaussian Splatting via City-Model Priors

Zhang, Qilin ; Zhu, Jinyu; Wysocki, Olaf; Busam, Benjamin; Jutzi, Boris ORCID iD icon 1
1 Institut für Photogrammetrie und Fernerkundung (IPF), Karlsruher Institut für Technologie (KIT)

Abstract:

Recent semantic 3D Gaussian Splatting (3DGS) methods primarily rely on 2D foundation models, often yielding ambiguous boundaries and limited support for structured urban semantics. While city models such as CityGML encode hierarchically organized semantics together with building geometry, these labels cannot be directly mapped to Gaussian primitives. We present GS4City, a hierarchical semantic Gaussian Splatting method that incorporates city-model priors for urban scene understanding. GS4City derives reliable image-aligned masks from Level of Detail (LoD) 3 CityGML models via two-pass raycasting, explicitly using parent-child relations to validate and recover fine-grained facade elements. It then fuses these geometry-grounded masks with foundation-model predictions to establish scene-consistent instance correspondences, and learns a compact identity encoding for each Gaussian under joint 2D identity supervision and 3D spatial regularization. Experiments on the TUM2TWIN and Gold Coast datasets show that GS4City effectively incorporates structured building semantics into Gaussian scene representations, outperforming existing 2D-driven semantic 3DGS baselines, including LangSplat and Gaga, by up to 15.8 IoU points in coarse building segmentation and 14.2 mIoU points in fine-grained semantic segmentation. ... mehr


Volltext §
DOI: 10.5445/IR/1000192368
Veröffentlicht am 17.04.2026
Originalveröffentlichung
DOI: 10.48550/arXiv.2604.11401
Cover der Publikation
Zugehörige Institution(en) am KIT Institut für Photogrammetrie und Fernerkundung (IPF)
Publikationstyp Forschungsbericht/Preprint
Publikationsjahr 2026
Sprache Englisch
Identifikator KITopen-ID: 1000192368
Verlag arxiv
Umfang 10 S.
Vorab online veröffentlicht am 13.04.2026
Schlagwörter Computer Vision and Pattern Recognition (cs.CV)
Nachgewiesen in OpenAlex
arXiv
KIT – Die Universität in der Helmholtz-Gemeinschaft
KITopen Landing Page