What does this research mean for the field?

Adapting denoising diffusion probabilistic models for image super-resolution (SR3) produces highly photo-realistic outputs that outperform state-of-the-art GAN methods in human evaluation. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

April 15, 2021Open Access

Image Super-Resolution via Iterative Refinement

Key Points

Key points are not available for this paper at this time.

Abstract

We present SR3, an approach to image Super-Resolution via Repeated Refinement. SR3 adapts denoising diffusion probabilistic models to conditional image generation and performs super-resolution through a stochastic denoising process. Inference starts with pure Gaussian noise and iteratively refines the noisy output using a U-Net model trained on denoising at various noise levels. SR3 exhibits strong performance on super-resolution tasks at different magnification factors, on faces and natural images. We conduct human evaluation on a standard 8X face super-resolution task on CelebA-HQ, comparing with SOTA GAN methods. SR3 achieves a fool rate close to 50%, suggesting photo-realistic outputs, while GANs do not exceed a fool rate of 34%. We further show the effectiveness of SR3 in cascaded image generation, where generative models are chained with super-resolution models, yielding a competitive FID score of 11.3 on ImageNet.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Chitwan Saharia

Google (United States)

Jonathan Ho

Queen Mary University of London

William Chan

University of Maryland, Baltimore

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Image Super-Resolution via Iterative Refinement

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study