What question did this study set out to answer?

The aim is to enhance large-scale 3D scene reconstruction by balancing quality, speed, and storage efficiency.

May 11, 2026

Efficient Large-Scale Scene Reconstruction via Semantic-Aware Hybrid Representation

Key Points

The aim is to enhance large-scale 3D scene reconstruction by balancing quality, speed, and storage efficiency.
Developed a semantic-guided adaptive modeling pipeline that integrates multi-view segmentation with the scene mesh.
Introduced a high-performance CUDA-based hybrid renderer that combines mesh rasterization with Gaussian splatting.
Proposed a mesh-guided sampling strategy for adding Gaussians to fine-tune under-reconstructed areas.
The approach reduces storage requirements significantly while maintaining comparable or superior visual quality.
Rendering performance is accelerated, benefiting from the efficient hybrid representation.
Experiments demonstrated effective partitioning of scenes and optimal utilization of Gaussians.

Abstract

Reconstructing large-scale 3D scenes remains challenging due to the need to balance photorealistic quality, real-time rendering, and compact storage. Recent progress in 3D Gaussian Splatting (3DGS) has achieved impressive fidelity and speed, yet its large-scale application suffers from excessive primitive counts, leading to prohibitive storage and rendering costs. To overcome this inefficiency, we introduce a novel semantic-guided hybrid representation that unifies textured meshes and 3D Gaussians in a differentiable framework. The key idea is to leverage meshes for geometrically regular regions such as roads and building facades, while reserving Gaussians for fine, complex details like vegetation. Our method is realized through three key technical contributions. First, we develop a semantic-guided adaptive modeling pipeline that fuses multi-view segmentation onto the scene mesh to robustly partition the scene and prune redundant Gaussians. Second, we introduce a high-performance CUDA-based hybrid renderer that seamlessly combines mesh rasterization with Gaussian splatting, enabling correct occlusion handling and joint optimization of both representations. Finally, we propose a mesh-guided sampling strategy that adaptively adds Gaussians to recover fine details in under-reconstructed areas. Extensive experiments on diverse large-scale datasets demonstrate that our approach significantly reduces storage requirements and accelerates rendering performance while maintaining comparable or superior visual quality.

Bookmark

Cite This Study

李虎森 et al. (Thu,) studied this question.

synapsesocial.com/papers/6a0171473a9f334c28271a34 https://doi.org/https://doi.org/10.1109/tvcg.2026.3691600

Bookmark