Knowledge distillation with Keras

Keras implementation of Hinton's knowledge distillation (KD), a way of transferring knowledge from a large model into a smaller model.

Summary

I use Caltech-256 dataset for a demonstration of the technique.
I transfer knowledge from Xception to MobileNet-0.25 and SqueezeNet v1.1.
Results:

model	accuracy, %	top 5 accuracy, %	logloss
Xception	82.3	94.7	0.705
MobileNet-0.25	64.6	85.9	1.455
MobileNet-0.25 with KD	66.2	86.7	1.464
SqueezeNet v1.1	67.2	86.5	1.555
SqueezeNet v1.1 with KD	68.9	87.4	1.297

Implementation details

I use pretrained on ImageNet models.
For validation I use 20 images from each category.
For training I use 100 images from each category.
I use random crops and color augmentation to balance the dataset.
I resize all images to 299x299.
In all models I train the last two layers.

Notes on `flow_from_directory`

I use three slightly different versions of Keras' ImageDataGenerator.flow_from_directory:

original version for initial training of Xception and MobileNet.
ver1 for getting logits from Xception. Now DirectoryIterator.next also outputs image names.
ver2 for knowledge transfer. Here DirectoryIterator.next packs logits with hard true targets. All three versions only differ in DirectoryIterator.next function.

Requirements

Python 3.5
Keras 2.0.6
torchvision, Pillow
numpy, pandas, tqdm

References

[1] Geoffrey Hinton, Oriol Vinyals, Jeff Dean, Distilling the Knowledge in a Neural Network

Name	Name	Last commit message	Last commit date
Latest commit TropComplique fix some errors Oct 20, 2017 d09cf8a · Oct 20, 2017 History 14 Commits
train_val_split	train_val_split	fix some errors	Oct 20, 2017
utils	utils	fix some errors	Oct 20, 2017
.gitignore	.gitignore	use squeezenet	Aug 4, 2017
README.md	README.md	Update README.md	Aug 14, 2017
get_logits_from_xception.ipynb	get_logits_from_xception.ipynb	add mobilenet	Aug 14, 2017
knowledge_distillation_for_mobilenet.ipynb	knowledge_distillation_for_mobilenet.ipynb	add mobilenet	Aug 14, 2017
knowledge_distillation_for_squeezenet.ipynb	knowledge_distillation_for_squeezenet.ipynb	add mobilenet	Aug 14, 2017
mobilenet.py	mobilenet.py	add mobilenet	Aug 14, 2017
squeezenet.py	squeezenet.py	add mobilenet	Aug 14, 2017
squeezenet_weights.hdf5	squeezenet_weights.hdf5	use squeezenet	Aug 4, 2017
train_xception.ipynb	train_xception.ipynb	add mobilenet	Aug 14, 2017
vanilla_mobilenet.ipynb	vanilla_mobilenet.ipynb	add mobilenet	Aug 14, 2017
vanilla_squeezenet.ipynb	vanilla_squeezenet.ipynb	add mobilenet	Aug 14, 2017
xception.py	xception.py	use squeezenet	Aug 4, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Knowledge distillation with Keras

Summary

Implementation details

Notes on `flow_from_directory`

Requirements

References

About

Releases

Packages

Contributors 2

Languages

TropComplique/knowledge-distillation-keras

Folders and files

Latest commit

History

Repository files navigation

Knowledge distillation with Keras

Summary

Implementation details

Notes on flow_from_directory

Requirements

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Notes on `flow_from_directory`

Packages