exploring-mnist-mix

Several explorations and experiments centered around the MNIST-MIX (arXiv, GitHub) dataset, focusing mainly on comparing feedforward and convolutional networks and examining the effects of various data manipulations on learning.

Final project for CS 445 / 545 at Portland State University, Spring 2020.

Created and maintained by:

Using `main.py`

Example -- create a new feedforward network, train and save it.

python main.py --create feedforward 100 100 100
               --labels specific
               --train data/splits/all_train.split 10 16 0.01 0.9
               --test data/splits/all_test.split tests/10_3x100.txt
               --log logs/10_3x100.log
               --save models/10_3x100.pt
               --gpu

Example -- load a ResNet model and test it on English.

python main.py --load models/10_resnet14.pt
               --labels specific
               --test data/splits/3_english_test.split tests/10_resnet14_english.txt
               --gpu

--create [type] [parameters] -- Use to build and train a new model. The type is either feedforward or resnet and the parameters are a sequence of numbers defining the model. For example, to create a feedforward model with three layers of 100 neurons each, use --create feedforward 100 100 100. Alternatively, to create an 8-layer ResNet, user --create resnet 1 1 1. (ResNet layers are calculated as 2 * sum(blocks) + 2. In this implementation, there are three block sizes to specify.)
--load [path] -- Use to load a PyTorch model already saved to disk. This cannot be used simultaneously with the --create flag.
--labels [specific/agnostic] -- Use this to specify whether image labels should be "specific" (100 classes) or "agnostic" (10 classes) to the language.
--train [path] [epochs] [batch size] [learning rate] [momentum] -- Use this to train a model. The path is to the split file, which you should create in the data folder.
--test [data path] [save path] -- Use this to test a model at the end of the script. The first path is to the split file to use. The second path is optional, and if used will specify the file to save the confusion matrix and accuracy to. If --train is used, then this must be used; however you can use --test on its own.
--log [path] -- Use this to save a file containing the training loss, validation loss, and accuracy at each epoch of training.
--save [path] -- Use this to save the PyTorch model to the specified path at the end of the script.
--gpu -- A binary flag, include it to train with CUDA.

Data summary

ID	Language	Train	Test	Total
0	Arabic	62400	10600	73000
1	Bangla	39990	9150	49140
2	Devanagari	2400	600	3000
3	English	240000	40000	280000
4	Farsi	60000	20000	80000
5	Kannada	60000	20240	80240
6	Swedish	6600	1000	7600
7	Telugu	2400	600	3000
8	Tibetan	14214	3554	17768
9	Urdu	6606	1414	8020

Name		Name	Last commit message	Last commit date
Latest commit History 2,426 Commits
data		data
ff2mixlog		ff2mixlog
ff2mixtest		ff2mixtest
logs		logs
models		models
other		other
output		output
resnet4mixlog		resnet4mixlog
resnet4mixtest		resnet4mixtest
tests		tests
.gitignore		.gitignore
Numbers.pdf		Numbers.pdf
Numbers.xlsx		Numbers.xlsx
README.md		README.md
feedforward.sh		feedforward.sh
main.py		main.py
mixedff2.pt		mixedff2.pt
mixedres4.pt		mixedres4.pt
models.py		models.py
opt.py		opt.py
same_labels.sh		same_labels.sh
table.sh		table.sh
table_ff.txt		table_ff.txt
table_res.txt		table_res.txt
testing_on_different_languages.sh		testing_on_different_languages.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

exploring-mnist-mix

Using `main.py`

Data summary

About

Releases

Packages

Contributors 4

Languages

khanu263/exploring-mnist-mix

Folders and files

Latest commit

History

Repository files navigation

exploring-mnist-mix

Using main.py

Data summary

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Using `main.py`

Packages