Key points are not available for this paper at this time.
We introduce InverseFaceNet, a deep convolutional inverse rendering framework for faces that jointly estimates facial pose, shape, expression, reflectance and illumination from a single input image. By estimating all parameters from just a single image, advanced editing possibilities on a single face image, such as appearance editing and relighting, become feasible in real time. Most previous learning-based face reconstruction approaches do not jointly recover all dimensions, or are severely limited in terms of visual quality. In contrast, we propose to recover high-quality facial pose, shape, expression, reflectance and illumination using a deep neural network that is trained using a large, synthetically created training corpus. Our approach builds on a novel loss function that measures model-space similarity directly in parameter space and significantly improves reconstruction accuracy. We further propose a self-supervised bootstrapping process in the network training loop, which iteratively updates the synthetic training corpus to better reflect the distribution of real-world imagery. We demonstrate that this strategy outperforms completely synthetically trained networks. Finally, we show high-quality reconstructions and compare our approach to several state-of-the-art approaches.
Building similarity graph...
Analyzing shared references across papers
Loading...
Hyeongwoo Kim
Electronics and Telecommunications Research Institute
Michael Zollhöfer
META Health
Ayush Tewari
University of Cambridge
Technical University of Munich
University of Bath
Max Planck Institute for Informatics
Building similarity graph...
Analyzing shared references across papers
Loading...
Kim et al. (Fri,) studied this question.
synapsesocial.com/papers/6a1723402fcf950e0005b446 — DOI: https://doi.org/10.1109/cvpr.2018.00486
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: