Los puntos clave no están disponibles para este artículo en este momento.
Learning matching costs has been shown to be critical to the success of the-of-the-art deep stereo matching methods, in which 3D convolutions are on a 4D feature volume to learn a 3D cost volume. However, this has never been employed for the optical flow task. This is mainly due the significantly increased search dimension in the case of optical flow, ie, a straightforward extension would require dense 4D in order to process a 5D feature volume, which is computationally. This paper proposes a novel solution that is able to bypass the of building a 5D feature volume while still allowing the network to suitable matching costs from data. Our key innovation is to decouple the between 2D displacements and learn the matching costs at each 2D hypothesis independently, ie, displacement-invariant cost. Specifically, we apply the same 2D convolution-based matching net on each 2D displacement hypothesis to learn a 4D cost volume. , we propose a displacement-aware projection layer to scale the learned volume, which reconsiders the correlation between different displacement and mitigates the multi-modal problem in the learned cost volume. cost volume is then projected to optical flow estimation through a 2D-argmin layer. Extensive experiments show that our approach achieves-of-the-art accuracy on various datasets, and outperforms all published flow methods on the Sintel benchmark.
Wang et al. (Wed,) studied this question.