Paper Re-implementation: Distilling the Knowledge in a Neural Network

This project implements and explores knowledge distillation techniques, as described in the seminal paper "Distilling the Knowledge in a Neural Network". The aim is to train a smaller, more efficient student model to mimic the behavior of a larger, more complex teacher model, preserving performance while reducing resource consumption.

Project Structure

funcs.py: Contains core helper functions for the project.

networks.py: Contains the teacher and student models implemented for the project.

teacher.ipynb: Jupyter Notebook for training the teacher model.

student.ipynb: Jupyter Notebook for training the student model using knowledge distillation.

How it Works

Knowledge Distillation Knowledge distillation is a technique where a teacher model trains a student model by transferring its learned knowledge. The student model learns not only from the true labels but also from the soft probabilities (soft targets) produced by the teacher model. This process is governed by a temperature parameter T and a weighting factor alpha to balance the contribution of distillation loss and true label loss.

Results and Observations

The temperature parameter (T) significantly influenced the student model's performance. Higher temperatures resulted in smoother probability distributions, aiding knowledge transfer. The Student model was observed to have improved in accuracy post-distillation compared to the models accuracy pre-distillation.

Acknowledgments

This project is inspired by the paper "Distilling the Knowledge in a Neural Network" by Geoffrey Hinton, Oriol Vinyals, and Jeff Dean.

Link to paper: https://arxiv.org/pdf/1503.02531

For questions or contributions, feel free to contact me or submit a pull request!

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
MNIST/raw		MNIST/raw
__pycache__		__pycache__
checkpoints_student		checkpoints_student
checkpoints_teacher		checkpoints_teacher
README.md		README.md
funcs.py		funcs.py
networks.py		networks.py
student.ipynb		student.ipynb
teacher.ipynb		teacher.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Paper Re-implementation: Distilling the Knowledge in a Neural Network

Project Structure

How it Works

Results and Observations

Acknowledgments

About

Releases

Packages

Languages

sai-srinivasan-v/Knowledge-Distillation-Paper-Implementation

Folders and files

Latest commit

History

Repository files navigation

Paper Re-implementation: Distilling the Knowledge in a Neural Network

Project Structure

How it Works

Results and Observations

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages