Image Super-Resolution is the task of converting a low-resolution image to a high-resolution one. In this project, we implemented and improved upon the state-of-the-art ESRGAN model to generate super-resolved images with increased stability and perceptual quality.
We conducted multiple experiments to modify the baseline model, training it on a large dataset. The resulting model produces smooth, realistic images that closely resemble real life and outperform the baseline model.
For our training dataset, we used DIV2K, which has 800 high definition high resolution images collected from the Internet. The low-resolution images are then generated using bicubic interpolation with downsampling factor of 4 and Gaussian blur.
Download the dataset: DIV2K Dataset
- Python 3
- Install the dependencies in requirements.txt
- Clone this github repo.
git clone https://github.com/williamcfrancis/Super-Resolution-with-Improved-ESRGAN.git
cd Super-Resolution-with-Improved-ESRGAN
- Place your own low-resolution images in
./LR
folder. - Pretrained model weights are present in './weights' folder
- Run validate.
python validate.py
- The results are in
./results
folder.
- Python 3
- Install the dependencies in requirements.txt
- Clone this github repo
git clone https://github.com/williamcfrancis/Super-Resolution-with-Improved-ESRGAN.git
cd Super-Resolution-with-Improved-ESRGAN
- Run dataset.py to prepare the datasets.
- Run train
python train.py
The following images show the comparison of the Low resolution images, images generated using our baseline model ESRGAN and our final model. The LR images are on the left, ESRGAN outputs are in the middle and our model outputs are on the right. The major difference between the performance of our model and the baseline is that it generates excessively sharp images which is undesirable as it decreases the perceptual quality of the image. Instead, our model produces smoother images which are more appealing to the human eye and do not look very cartoon-like. The images generated from our model resemble the real life images more closely than the baseline.