assignment3

Aug 27, 2016

427bfbe · Aug 27, 2016

Name	Name	Last commit message	Last commit date
parent directory ..
cs231n	cs231n	LSTM_Captioning complete	Aug 16, 2016
.gitignore	.gitignore	RNN_Captioning complete	Aug 15, 2016
ImageGeneration.ipynb	ImageGeneration.ipynb	RNN_Captioning complete	Aug 15, 2016
ImageGradients.ipynb	ImageGradients.ipynb	RNN_Captioning complete	Aug 15, 2016
LSTM_Captioning.ipynb	LSTM_Captioning.ipynb	LSTM_Captioning complete	Aug 16, 2016
README.md	README.md	Update README.md	Aug 27, 2016
RNN_Captioning.ipynb	RNN_Captioning.ipynb	LSTM_Captioning complete	Aug 16, 2016
collectSubmission.sh	collectSubmission.sh	RNN_Captioning complete	Aug 15, 2016
frameworkpython	frameworkpython	RNN_Captioning complete	Aug 15, 2016
kitten.jpg	kitten.jpg	RNN_Captioning complete	Aug 15, 2016
sky.jpg	sky.jpg	RNN_Captioning complete	Aug 15, 2016
start_ipython_osx.sh	start_ipython_osx.sh	RNN_Captioning complete	Aug 15, 2016

README.md

In this assignment you will implement recurrent networks, and apply them to image captioning on Microsoft COCO. We will also introduce the TinyImageNet dataset, and use a pretrained model on this dataset to explore different applications of image gradients.

The goals of this assignment are as follows:

understand the architecture of recurrent neural networks (RNNs) and how they operate on sequences by sharing weights over time
understand the difference between vanilla RNNs and Long-Short Term Memory (LSTM)
understand how to sample from an RNN at test-time
understand how to combine convolutional neural nets and recurrent nets to implement an image captioning system
understand how a trained convolutional network can be used to compute gradients with respect to the input image
implement and different applications of image gradients, including saliency maps, fooling images, class visualizations, feature inversion, and DeepDream.

Q1: Image Captioning with Vanilla RNNs (Completed)

The IPython notebook RNN_Captioning.ipynb will walk you through the implementation of an image captioning system on MS-COCO using vanilla recurrent networks.

Q2: Image Captioning with LSTMs (Completed)

The IPython notebook LSTM_Captioning.ipynb will walk you through the implementation of Long-Short Term Memory (LSTM) RNNs, and apply them to image captioning on MS-COCO.

Q3: Image Gradients: Saliency maps and Fooling Images (Not Yet)

The IPython notebook ImageGradients.ipynb will introduce the TinyImageNet dataset. You will use a pretrained model on this dataset to compute gradients with respect to the image, and use them to produce saliency maps and fooling images.

Q4: Image Generation: Classes, Inversion, DeepDream (Not Yet)

In the IPython notebook ImageGeneration.ipynb you will use the pretrained TinyImageNet model to generate images. In particular you will generate class visualizations and implement feature inversion and DeepDream.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

assignment3

assignment3

README.md

Q1: Image Captioning with Vanilla RNNs (Completed)

Q2: Image Captioning with LSTMs (Completed)

Q3: Image Gradients: Saliency maps and Fooling Images (Not Yet)

Q4: Image Generation: Classes, Inversion, DeepDream (Not Yet)

Files

assignment3

Directory actions

More options

Directory actions

More options

Latest commit

History

assignment3

Folders and files

parent directory

README.md

Q1: Image Captioning with Vanilla RNNs (Completed)

Q2: Image Captioning with LSTMs (Completed)

Q3: Image Gradients: Saliency maps and Fooling Images (Not Yet)

Q4: Image Generation: Classes, Inversion, DeepDream (Not Yet)