Home  | Publications | ZZW+26

GS4City: Hierarchical Semantic Gaussian Splatting via City-Model Priors

MCML Authors

Abstract

Recent semantic 3D Gaussian Splatting (3DGS) methods primarily rely on 2D foundation models, often yielding ambiguous boundaries and limited support for structured urban semantics. While city models such as CityGML encode hierarchically organized semantics together with building geometry, these labels cannot be directly mapped to Gaussian primitives. We present GS4City, a hierarchical semantic Gaussian Splatting method that incorporates citymodel priors for urban scene understanding. GS4City derives reliable image-aligned masks from Level of Detail (LoD) 3 CityGML models via two-pass raycasting, explicitly using parent-child relations to validate and recover finegrained facade elements. It then fuses these geometrygrounded masks with foundation model predictions to establish scene-consistent instance correspondences, and learns a compact identity encoding for each Gaussian under joint 2D identity supervision and 3D spatial regularization. Experiments on the TUM2TWIN and Gold Coast datasets show that GS4City effectively incorporates structured building semantics into Gaussian scene representations, outperforming existing 2D-driven semantic 3DGS baselines, including LangSplat and Gaga, by up to 15.8 IoU points in coarse building segmentation and 14.2 mIoU points in fine-grained semantic segmentation. By bridging structured city models and photorealistic Gaussian scene representations, GS4City enables semantically queryable and structure-aware urban reconstruction.

misc ZZW+26


Preprint

Apr. 2026

Authors

Q. Zhang • J. Zhu • O. Wysocki • B. Busam • B. Jutzi

Links

arXiv GitHub

Research Area

 B1 | Computer Vision

BibTeXKey: ZZW+26

Back to Top