March 3, 2026Open Access

Transforming panoramic images into 3D experiences for online cultural heritage visualization

Key Points

Results demonstrate improved spatial fidelity and cost-efficiency using the computer vision approach, providing an enhanced user experience.
User experiments involving 33 participants evaluated key performance metrics related to the 3D models generated from panoramic images.
Automated depth estimation networks were integrated to streamline the 3D reconstruction process, minimizing technical barriers for users.
This approach supports resource-limited digital heritage projects, potentially transforming cultural heritage presentation and accessibility.

Abstract

Online cultural heritage presentations are transitioning from traditional media to immersive 3D displays, yet high-fidelity on-site 3D modeling faces copyright, security, and preservation constraints. This study proposes a deep learning-based 3D reconstruction method using existing panoramic images, demonstrated through Dunhuang’s Mogao Caves. Our automated workflow employs pre-trained depth estimation networks to generate 3D models, substantially reducing costs and technical barriers. Four techniques—Panoramic Display (M1), Box Projection (M2), Photogrammetry (M3), and Computer Vision (M4)—were integrated into a unified VR platform. User experiments (N = 33) combining spatial behavior tracking and questionnaires evaluated key performance metrics. Results demonstrate that the computer vision approach optimally balances spatial fidelity, cost-efficiency, and accessibility, offering a scalable solution for resource-limited digital heritage projects.

Bookmark

View Full Paper

Cite This Study

Xu et al. (Thu,) studied this question.

synapsesocial.com/papers/69a75db9c6e9836116a27f0e https://doi.org/https://doi.org/10.1038/s40494-025-02239-z

Bookmark

View Full Paper