Alert button

Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models

Apr 03, 2024
Taiqiang Wu, Chaofan Tao, Jiahao Wang, Zhe Zhao, Ngai Wong

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: