Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Highlight Every Step: Knowledge Distillation via Collaborative Teaching

Jul 23, 2019

Haoran Zhao, Xin Sun, Junyu Dong, Changrui Chen, Zihe Dong

Figure 1 for Highlight Every Step: Knowledge Distillation via Collaborative Teaching

Figure 2 for Highlight Every Step: Knowledge Distillation via Collaborative Teaching

Figure 3 for Highlight Every Step: Knowledge Distillation via Collaborative Teaching

Figure 4 for Highlight Every Step: Knowledge Distillation via Collaborative Teaching

Share this with someone who'll enjoy it:

Abstract:High storage and computational costs obstruct deep neural networks to be deployed on resource-constrained devices. Knowledge distillation aims to train a compact student network by transferring knowledge from a larger pre-trained teacher model. However, most existing methods on knowledge distillation ignore the valuable information among training process associated with training results. In this paper, we provide a new Collaborative Teaching Knowledge Distillation (CTKD) strategy which employs two special teachers. Specifically, one teacher trained from scratch (i.e., scratch teacher) assists the student step by step using its temporary outputs. It forces the student to approach the optimal path towards the final logits with high accuracy. The other pre-trained teacher (i.e., expert teacher) guides the student to focus on a critical region which is more useful for the task. The combination of the knowledge from two special teachers can significantly improve the performance of the student network in knowledge distillation. The results of experiments on CIFAR-10, CIFAR-100, SVHN and Tiny ImageNet datasets verify that the proposed knowledge distillation method is efficient and achieves state-of-the-art performance.

View paper on

Share this with someone who'll enjoy it:

Title:Highlight Every Step: Knowledge Distillation via Collaborative Teaching

Paper and Code