This is the official implementation for DR-GAN (IEEE TCSVT'20).
Kang Liao, Chunyu Lin, Yao Zhao, Moncef Gabbouj
Given a radial distortion image capture by wide-angle lens, DR-GAN aims to rectify the distortion and recover the realistic scene.
- First generative framework for the distortion rectification
- One-stage rectification (Compared to previous two-stage rectification: distortion parameter estimation -> rectification)
- Label-free training (Directly learning the mapping between distorted structure and rectified structure)
- Real-time rectification (~66 FPS on NVIDIA GeForce RTX 2080Ti and ~26 FPS on TITAN X)
Our recent work MOWA can solve multiple image warping tasks in a single and unified framework, including image rectangling, distortion rectification, and other practical tasks. Check out more details here!
- Python 3.5.6 (or higher)
- Tensorflow 1.12.0
- Keras 2.2.4
- OpenCV
- numpy
- matplotlib
- scikit-image
git clone https://github.com/KangLiao929/DR-GAN.git
cd DR-GAN/
We synthesize the radial distortion image dataset including the training and test data (dataset), based on the polynomial camera model (four distortion parameters: k1, k2, k3, and k4 involved).
-
Download the pretrained models through the following links (generator), and put them into
weights/
. -
To test images in a folder, you can call
test.py
with the opinion--test_path
and--load_models
. For example:python test.py --test_num 100 --test_path "./DR-GAN/dataset/test/" --load_models "./DR-GAN/weights/generator.h5" --write_path "./DR-GAN/dataset/pre/"
or write / modify
test.sh
according to your own needs, then execute this script as (Linux platform):sh ./test.sh
The visual evaluations will be saved in the folder ./dataset/pre/
.
- Generate the training dataset or download our synthesized dataset into the path
dataset/train/
. - To train DR-GAN, you can call
train.py
with the opinion--train_path
. For example:or write / modifypython train.py --train_path ./DR-GAN/dataset/train/ --batch_size 16 --gpu "0"
train.sh
according to your own needs, then execute this script as:sh ./train.sh
Compared to previous parameter-based methods, our DR-GAN is the first attempt at the generation-based solution and achieves real-time rectification. However, it has the following limitations which could be the possible effort directions and open questions for future works.
- Blurred rectified details. Due to the vanilla skip-connection and many downsampling layers in the generator network, our rectified images suffer from visually unpleasing details. See our solution.
- Lack of camera parameters. Due to directly learning the geometric transformation mapping, DR-GAN does not rely on parameter estimation. However, for other research fields such as camera calibration and SfM, the camera parameters are crucial. See our solution.
- Interpretability. Although the generative model excels at learning the transformation given a pair, it is difficult to explain how does the network perceive different distortions of different images. In general, the distortion features are more implicit than the semantic features, so how GAN can handle the distortion perception and distortion rectification simultaneously? Till now, we have not explored a reasonable explanation for this point.
If you have any suggestions or need to discuss, please feel free to contact me -- [email protected].
If our solution and open questions are useful for your research, please consider citing:
@ARTICLE{liao2020drgan,
author={Liao, Kang and Lin, Chunyu and Zhao, Yao and Gabbouj, Moncef},
journal={IEEE Transactions on Circuits and Systems for Video Technology},
title={DR-GAN: Automatic Radial Distortion Rectification Using Conditional GAN in Real-Time},
year={2020},
volume={30},
number={3},
pages={725-733}}
@inproceedings{yang2021progressively,
title={Progressively Complementary Network for Fisheye Image Rectification Using Appearance Flow},
author={Yang, Shangrong and Lin, Chunyu and Liao, Kang and Zhang, Chunjie and Zhao, Yao},
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
pages={6348--6357},
year={2021}
}
@article{liao2021deep,
title={A Deep Ordinal Distortion Estimation Approach for Distortion Rectification},
author={Liao, Kang and Lin, Chunyu and Zhao, Yao},
journal={IEEE Transactions on Image Processing},
volume={30},
pages={3362--3375},
year={2021}
}