Key points are not available for this paper at this time.
Novel-view synthesis with sparse input views is important for practical applications such as AR/VR and autonomous driving. Many works in this field have already integrated depth information into NeRF, utilizing depth priors for assistance in geometric and spatial understanding. However, most existing work tends to either overlook the inaccuracies in depth maps or only handle them roughly, limiting the effectiveness of the synthesis. To address this issue, we propose a depth-guided robust point cloud fusion NeRF for sparse input synthesis. We first construct a point cloud for each input view, with a novel point cloud representation based on learnable matrices and vectors. Then, through an additional lightweight scene fusion network, we fuse the point clouds from each input view to build a point cloud of the entire scene. By optimizing the point cloud representation and scene fusion network, inaccuracies in the depth map can be adjusted and refined, thereby achieving a more precise perception of the overall scene. Each voxel in the scene is determined by referencing the fused point cloud to establish its density and appearance. Experimental results demonstrate that our method outperforms state-of-the-art baselines.
Building similarity graph...
Analyzing shared references across papers
Loading...
Shuai Guo
Qiuwen Wang
Yijie Gao
IEEE Transactions on Circuits and Systems for Video Technology
Shanghai Jiao Tong University
China Mobile (China)
State Key Laboratory of Mobile Networks and Mobile Multimedia Technology
Building similarity graph...
Analyzing shared references across papers
Loading...
Guo et al. (Mon,) studied this question.
www.synapsesocial.com/papers/68e6fff6b6db643587679a9b — DOI: https://doi.org/10.1109/tcsvt.2024.3385360