Key points are not available for this paper at this time.
Light field (LF) videos contain not only the spatial-angular information but also the temporal information, which are useful for disparity estimation. The existing work on disparity estimation for LF videos relies on supervised training with disparity labels. To overcome this reliance, we develop an unsupervised disparity estimation framework for LF videos, which consists of a matching branch to perform feature matching and a refinement branch to refine the disparity maps. Our framework also includes a cross-feature fusion module with self-attention and cross-attention to fuse the multi-frame features, and a cost aggregation transformer with cross-depth self-attention blocks to explore the global depth dependencies. Moreover, we propose a left-right consistency strategy to estimate the occlusion regions for the input views and introduce a occlusion-aware photometric loss to solve the occlusion issue. Experimental results demonstrate that our method achieves superior performance compared to the existing supervised and unsupervised methods.
Building similarity graph...
Analyzing shared references across papers
Loading...
Shansi Zhang
China Electronics Technology Group Corporation
Edmund Y. Lam
Education University of Hong Kong
University of Hong Kong
Building similarity graph...
Analyzing shared references across papers
Loading...
Zhang et al. (Mon,) studied this question.
synapsesocial.com/papers/68e73894b6db6435876b1f8f — DOI: https://doi.org/10.1109/icassp48485.2024.10446981