Automating immersive VR scene creation remains a primary research challenge. Existing methods typically rely on complex geometry with post-simplification, resulting in inefficient pipelines or limited realism. In this paper, we introduce ImmerseGen, a novel agent-guided framework for compact and photorealistic world generation that decouples realism from exhaustive geometric modeling. ImmerseGen represents scenes as hierarchical compositions of lightweight geometric proxies with synthesized RGBA textures, facilitating real-time rendering on mobile VR headsets. We propose terrain-conditioned texturing for base world generation, combined with context-aware texturing for scenery, to produce diverse and visually coherent worlds. VLM-based agents employ semantic grid-based analysis for precise asset placement and enrich scenes with multimodal enhancements such as visual dynamics and ambient sound. Experiments and real-time VR applications demonstrate that ImmerseGen achieves superior photorealism, spatial coherence, and rendering efficiency compared to existing methods.
Building similarity graph...
Analyzing shared references across papers
Loading...
Jinyan Yuan
Bangbang Yang
Keke Wang
IEEE Transactions on Visualization and Computer Graphics
Zhejiang University
Building similarity graph...
Analyzing shared references across papers
Loading...
Yuan et al. (Thu,) studied this question.
www.synapsesocial.com/papers/69cf5ede5a333a821460d958 — DOI: https://doi.org/10.1109/tvcg.2026.3679097
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: