Recovering Realistic Texture in Image Super-resolution by Deep Spatial Feature Transform

CVPR 2018  ·  Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy ·

Despite that convolutional neural networks (CNN) have recently demonstrated high-quality reconstruction for single-image super-resolution (SR), recovering natural and realistic texture remains a challenging problem. In this paper, we show that it is possible to recover textures faithful to semantic classes. In particular, we only need to modulate features of a few intermediate layers in a single network conditioned on semantic segmentation probability maps. This is made possible through a novel Spatial Feature Transform (SFT) layer that generates affine transformation parameters for spatial-wise feature modulation. SFT layers can be trained end-to-end together with the SR network using the same loss function. During testing, it accepts an input image of arbitrary size and generates a high-resolution image with just a single forward pass conditioned on the categorical priors. Our final results show that an SR network equipped with SFT can generate more realistic and visually pleasing textures in comparison to state-of-the-art SRGAN and EnhanceNet.

PDF Abstract CVPR 2018 PDF CVPR 2018 Abstract

Datasets


Introduced in the Paper:

OST300

Used in the Paper:

BSD Set14 BSD100
Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Image Super-Resolution BSD100 - 4x upscaling SFT-GAN PSNR 25.33 # 55
SSIM 0.651 # 53
Image Super-Resolution Set14 - 4x upscaling SFT-GAN PSNR 26.13 # 78
SSIM 0.694 # 71

Methods