Loss Change Allocation

Introduction

This repository contains source code for the experiments in LCA: Loss Change Allocation for Neural Network Training (to be presented at NeurIPS 2019) by Janice Lan, Rosanne Liu, Hattie Zhou, and Jason Yosinski.

@inproceedings{lan-2019-loss-change-allocation,
  title={LCA: Loss Change Allocation for Neural Network Training},
  author={Janice Lan and Rosanne Liu and Hattie Zhou and Jason Yosinski},
  booktitle={Advances in Neural Information Processing Systems},
  year={2019}
}

To read more about this project, see the Uber Eng blog post.

Codebase structure

data/download_mnist.py and data/download_cifar10.py downloads MNIST/CIFAR10 data and splits it into train, val, and test, and saves them in the data folder as h5 files
train.py trains a model and saves weights at every iteration, using architectures defined in network_builders.py
adaptive_calc_gradients.py calculates gradients used for LCA, based on saved weights
save_lca_stream.py calculates LCA for runs where gradients or weights don't fit into memory
plot_util.py demonstrates some examples of various visualizations, aggregations, and analyses

(Optional) This codebase uses the GitResultsManager package to keep track of experiments. See: https://github.com/yosinski/GitResultsManager

Example commands

To train: python train.py --train_h5 data/cifar10_train.h5 --test_h5 data/cifar10_val.h5 --input_dim 32,32,3 --arch resnet --opt sgd --lr 0.1 --save_weights --num_epochs 25 --large_batch_size 5000 --test_batch_size 5000 --eval_every 100 --print_every 100 --log_every 50 --output_dir outputs
To calculate gradients: python adaptive_calc_gradients.py --train_h5 data/cifar10_train.h5 --test_h5 data/cifar10_val.h5 --input_dim 32,32,3 --arch resnet --opt sgd --large_batch_size 2500 --test_batch_size 2500 --num_gpus 4 --weights_h5 outputs/weights

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
brook		brook
data		data
general		general
tf_nets		tf_nets
tf_plus		tf_plus
.fossa.yml		.fossa.yml
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
adaptive_calc_gradients.py		adaptive_calc_gradients.py
network_builders.py		network_builders.py
plot_util.py		plot_util.py
requirements.txt		requirements.txt
save_lca_stream.py		save_lca_stream.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Loss Change Allocation

Introduction

Codebase structure

Example commands

About

Releases

Packages

Languages

License

uber-research/loss-change-allocation

Folders and files

Latest commit

History

Repository files navigation

Loss Change Allocation

Introduction

Codebase structure

Example commands

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages