layout | title | description | header-img |
---|---|---|---|
page |
About |
Cycle GAN - Computer Vision @ UIUC |
img/home-bg.jpg |
In this project, the idea is to implement the algorithm developed by Jun-Yan Zhu
, Taesung Park
, Phillip Isola
and Alexei A. Efros
in their paper “Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks”.
<img src= "{{ "/img/utils/paper.jpg " | prepend: site.baseurl }}" style="width: 60%; margin-left: 1%; margin-right: 1%;">This algorithm learns a mapping function
Implement the method proposed in the paper and test it on test and self provided images. Eg. apples to oranges and vice versa, horses to zebras and vice versa, etc.
Implement this algorithm and propose certain modifications to enhance the outputs and also make it work on videos. Eg. the seasonal change video discussed in the introduction.
Although, team members have some basic understanding of neural networks, understanding the underlying network graph for unpaired image to image translation could be challenging. Training the entire dataset could be a mammoth task both computationally and schematically, hence implementing a toy example for unpaired image to image translation is the minimum task for this project.
All three team members are graduate (Masters’) students in the Department of Industrial Engineering with a concentration in Advanced Analytics. Ziyu and Naman are currently enrolled in the CS446: Machine Learning course which requires them to implement various machine learning algorithms on TensorFlow. The course will also be providing them with an introduction to Generative Adversarial Networks (GANs). Harshad has worked on research projects with his advisor that required him to work with Machine Learning algorithms and implement small Neural Networks on Keras. All three members want to advance their knowledge in Deep Learning and Computer Vision as they desire to pursue their careers as Machine Learning Engineers/Data Scientists. The team does not have prior experience in computer vision and deep learning research. This project will provide them with an excellent opportunity to learn about neural networks and GAN’s.
The standard dataset for unpaired image to image translation is available at EECS UC Berkeley’s CycleGAN web page.
This data set have following image clusters :
Data | Date | Size |
---|---|---|
ae_photos.zip | 2017-04-03 22:06 | 10M |
apple2orange.zip | 2017-03-28 13:51 | 75M |
cezanne2photo.zip | 2017-03-28 13:51 | 267M |
cityscapes.zip | 2017-03-29 03:22 | 267M |
facades.zip | 2017-03-29 23:23 | 34M |
horse2zebra.zip | 2017-03-28 13:51 | 111M |
iphone2dslr_flower.zip | 2017-03-30 12:05 | 324M |
maps.zip | 2017-03-26 19:17 | 1.4G |
monet2photo.zip | 2017-03-26 19:17 | 291M |
Summer2winter.zip | 2017-03-26 19:17 | 126M |
ukiyoe2photo.zip | 2017-03-26 19:17 | 279M |
vangogh2photo.zip | 2017-03-26 19:17 | 292M |
Note : The script to download the dataset can be found at Cycle GAN repository and can we used by following command :
sh ./download_dataset.sh <name of dataset>
The team requires access to BlueWaters for training the network as none of the team members have a GPU or access to any cloud computing platforms.