Explainability-based Knowledge Distillation

Our code is based on mdistiller for knowledge distillation (https://github.com/megvii-research/mdistiller.git), and Transformer-Explainability for explainability tests (https://github.com/hila-chefer/Transformer-Explainability.git).

Explainability-based Knowledge Distillation

Framework

Main Benchmark Results

On CIFAR-100:

Teacher Student	ResNet56 ResNet20	ResNet110 ResNet32	ResNet32x4 ResNet8x4	WRN-40-2 WRN-16-2	WRN-40-2 WRN-40-1	VGG13 VGG8
KD	70.66	73.08	73.33	74.92	73.54	72.98
Exp-KD	71.77	74.13	77.36	76.27	74.77	74.85

Teacher Student	ResNet32x4 ShuffleNet-V1	WRN-40-2 ShuffleNet-V1	VGG13 MobileNet-V2	ResNet50 MobileNet-V2	ResNet32x4 MobileNet-V2
KD	74.07	74.83	67.37	67.35	74.45
Exp-KD	78.20	77.75	70.63	71.74	78.98

On ImageNet:

Teacher Student	ResNet34 ResNet18	ResNet50 MobileNet-V1
KD	71.03	70.50
Exp-KD	71.74	72.43

Exp-KD

Installation

Environments:

Python 3.6
PyTorch 1.9.0
torchvision 0.10.0

Install the package:

sudo pip3 install -r requirements.txt
sudo python3 setup.py develop

Getting started

Wandb as the logger

The registeration: https://wandb.ai/home.
If you don't want wandb as your logger, set CFG.LOG.WANDB as False at mdistiller/engine/cfg.py.

Evaluation

You can evaluate the performance of models trained by yourself.

If test the models on ImageNet, please download the dataset at https://image-net.org/ and put them to ./data/imagenet

# evaluate teachers
python3 tools/eval.py -m resnet32x4 # resnet32x4 on cifar100
python3 tools/eval.py -m ResNet34 -d imagenet # ResNet34 on imagenet

# evaluate students

python3 tools/eval.py -m model_name -c output/your_exp/student_best # your checkpoints

Training on CIFAR-100

Download the cifar_teachers.tar at https://github.com/megvii-research/mdistiller/releases/tag/checkpoints and untar it to ./download_ckpts via tar xvf cifar_teachers.tar.

# for instance, our Exp-KD method.
python3 tools/train.py --cfg configs/cifar100/cam/res32x4_res8x4.yaml

# you can also change settings at command line
python3 tools/train.py --cfg configs/cifar100/cam/res32x4_res8x4.yaml SOLVER.BATCH_SIZE 128 SOLVER.LR 0.1

Training on ImageNet

Download the dataset at https://image-net.org/ and put them to ./data/imagenet

# for instance, our Exp-KD method.
python3 tools/train.py --cfg configs/imagenet/r34_r18/cam.yaml

Training on MS-COCO

see detection.md

Extension: Visualizations

Jupyter notebooks: tsne and correlation_matrices

License

MDistiller is released under the MIT license. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github		.github
.vscode		.vscode
configs		configs
detection		detection
mdistiller		mdistiller
tools		tools
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
arch.jpg		arch.jpg
hier_test.sh		hier_test.sh
requirements.txt		requirements.txt
setup.py		setup.py
train.py		train.py
tsne_cam.jpg		tsne_cam.jpg
tsne_kd.jpg		tsne_kd.jpg
tsne_teacher.jpg		tsne_teacher.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Explainability-based Knowledge Distillation

Framework

Main Benchmark Results

Exp-KD

Installation

Getting started

License

About

Releases

Packages

Languages

License

Blenderama/Exp-KD

Folders and files

Latest commit

History

Repository files navigation

Explainability-based Knowledge Distillation

Framework

Main Benchmark Results

Exp-KD

Installation

Getting started

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages