README.md

ESRGAN (Enhanced SRGAN) [Paper] [BasicSR]

? Training codes are in BasicSR repo.

Enhanced Super-Resolution Generative Adversarial Networks

By Xintao Wang, Ke Yu, Shixiang Wu, Jinjin Gu, Yihao Liu, Chao Dong, Yu Qiao, Chen Change Loy

This repo only provides simple testing codes, pretrained models and the network strategy demo.

For full training and testing codes, please refer to BasicSR.

We won the first place in PIRM2018-SR competition (region 3) and got the best perceptual index. The paper is accepted to ECCV2018 PIRM Workshop.

? Add Frequently Asked Questions.

For instance,

How to reproduce your results in the PIRM18-SR Challenge (with low perceptual index)?

How do you get the perceptual index in your ESRGAN paper?

BibTeX

@InProceedings{wang2018esrgan,
    author = {Wang, Xintao and Yu, Ke and Wu, Shixiang and Gu, Jinjin and Liu, Yihao and Dong, Chao and Qiao, Yu and Loy, Chen Change},
    title = {ESRGAN: Enhanced super-resolution generative adversarial networks},
    booktitle = {The European Conference on Computer Vision Workshops (ECCVW)},
    month = {September},
    year = {2018}
}

The RRDB_PSNR PSNR_oriented model trained with DF2K dataset (a merged dataset with DIV2K and Flickr2K (proposed in EDSR)) is also able to achive high PSNR performance.

Method Training dataset Set5 Set14 BSD100 Urban100 Manga109 SRCNN 291 30.48/0.8628 27.50/0.7513 26.90/0.7101 24.52/0.7221 27.58/0.8555 EDSR DIV2K 32.46/0.8968 28.80/0.7876 27.71/0.7420 26.64/0.8033 31.02/0.9148 RCAN DIV2K 32.63/0.9002 28.87/0.7889 27.77/0.7436 26.82/ 0.8087 31.22/ 0.9173 RRDB(ours) DF2K 32.73/0.9011 28.99/0.7917 27.85/0.7455 27.03/0.8153 31.66/0.9196

Quick Test

Dependencies

Python 3
PyTorch >= 0.4.0
Python packages: pip install numpy opencv-python

Test models

Clone this github repo.

git clone https://github.com/xinntao/ESRGAN
cd ESRGAN

Place your own low-resolution images in ./LR folder. (There are two sample images - baboon and comic).
Download pretrained models from Google Drive or Baidu Drive. Place the models in ./models. We provide two models with high perceptual quality and high PSNR performance (see model list).
Run test. We provide ESRGAN model and RRDB_PSNR model.

python test.py models/RRDB_ESRGAN_x4.pth
python test.py models/RRDB_PSNR_x4.pth

The results are in ./results folder.

Network interpolation demo

You can interpolate the RRDB_ESRGAN and RRDB_PSNR models with alpha in [0, 1].

Run python net_interp.py 0.8, where 0.8 is the interpolation parameter and you can change it to any value in [0,1].
Run python test.py models/interp_08.pth, where models/interp_08.pth is the model path.

Perceptual-driven SR Results

You can download all the resutls from Google Drive. (✔️ included; ➖ not included; ⭕️ TODO)

HR images can be downloaed from BasicSR-Datasets.

Datasets LR ESRGAN SRGAN EnhanceNet CX Set5 ✔️ ✔️ ✔️ ✔️ ⭕️ Set14 ✔️ ✔️ ✔️ ✔️ ⭕️ BSDS100 ✔️ ✔️ ✔️ ✔️ ⭕️ PIRM
(val, test) ✔️ ✔️ ➖ ✔️ ✔️ OST300 ✔️ ✔️ ➖ ✔️ ⭕️ urban100 ✔️ ✔️ ➖ ✔️ ⭕️ DIV2K
(val, test) ✔️ ✔️ ➖ ✔️ ⭕️

ESRGAN

We improve the SRGAN from three aspects:

adopt a deeper model using Residual-in-Residual Dense Block (RRDB) without batch normalization layers.
employ Relativistic average GAN instead of the vanilla GAN.
improve the perceptual loss by using the features before activation.

In contrast to SRGAN, which claimed that deeper models are increasingly difficult to train, our deeper ESRGAN model shows its superior performance with easy training.

Network Interpolation

We propose the network interpolation strategy to balance the visual quality and PSNR.

We show the smooth animation with the interpolation parameters changing from 0 to 1. Interestingly, it is observed that the network interpolation strategy provides a smooth control of the RRDB_PSNR model and the fine-tuned ESRGAN model.

Qualitative Results

PSNR (evaluated on the Y channel) and the perceptual index used in the PIRM-SR challenge are also provided for reference.

Ablation Study

Overall visual comparisons for showing the effects of each component in ESRGAN. Each column represents a model with its configurations in the top. The red sign indicates the main improvement compared with the previous model.

BN artifacts

We empirically observe that BN layers tend to bring artifacts. These artifacts, namely BN artifacts, occasionally appear among iterations and different settings, violating the needs for a stable performance over training. We find that the network depth, BN position, training dataset and training loss have impact on the occurrence of BN artifacts.

Useful techniques to train a very deep network

We find that residual scaling and smaller initialization can help to train a very deep network. More details are in the Supplementary File attached in our paper.

The influence of training patch size

We observe that training a deeper network benefits from a larger patch size. Moreover, the deeper model achieves more improvement (∼0.12dB) than the shallower one (∼0.04dB) since larger model capacity is capable of taking full advantage of larger training patch size. (Evaluated on Set5 dataset with RGB channels.)

GitHub - xinntao/ESRGAN: ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challe...

README.md

ESRGAN (Enhanced SRGAN) [Paper] [BasicSR]

? Training codes are in BasicSR repo.

Enhanced Super-Resolution Generative Adversarial Networks

For full training and testing codes, please refer to BasicSR.

BibTeX

Quick Test

Dependencies

Test models

Network interpolation demo

Perceptual-driven SR Results

ESRGAN

Network Interpolation

Qualitative Results

Ablation Study

BN artifacts

Useful techniques to train a very deep network

The influence of training patch size

Recommend

GitHub - pjialin/py12306: ? 12306 购票助手，支持分布式，多账号，多任务购票

为什么能力强的人，有时候竞争力反而弱？

短视频管理规范100条来了，抖音快手继续“颤抖”

柳传志：以贸养技是迫不得已志存高远也要脚踏实地

马斯克确认特斯拉新跑车能飞：将用火箭推进系统技术

腾讯技术委员会正式成立，地位高于六大事业群

世界首富离婚了，前妻身家有望超越马化腾

GitHub - mtojek/greenwall: Tiny service health dashboard written in Go

蓝标转债我清仓了，接下来看大家发财了。

全球最贵离婚案即将诞生：贝佐斯发妻或成全球女首富

About Joyk