Alert button
Picture for Dacheng Tao

Dacheng Tao

Alert button

Revisiting Knowledge Distillation for Autoregressive Language Models

Feb 19, 2024
Qihuang Zhong, Liang Ding, Li Shen, Juhua Liu, Bo Du, Dacheng Tao

Viaarxiv icon

ROSE Doesn't Do That: Boosting the Safety of Instruction-Tuned Large Language Models with Reverse Prompt Contrastive Decoding

Feb 19, 2024
Qihuang Zhong, Liang Ding, Juhua Liu, Bo Du, Dacheng Tao

Viaarxiv icon

Communication-Efficient Distributed Learning with Local Immediate Error Compensation

Feb 19, 2024
Yifei Cheng, Li Shen, Linli Xu, Xun Qian, Shiwei Wu, Yiming Zhou, Tie Zhang, Dacheng Tao, Enhong Chen

Viaarxiv icon

Towards Theoretical Understandings of Self-Consuming Generative Models

Feb 19, 2024
Shi Fu, Sen Zhang, Yingjie Wang, Xinmei Tian, Dacheng Tao

Viaarxiv icon

Continual Learning on Graphs: Challenges, Solutions, and Opportunities

Feb 18, 2024
Xikun Zhang, Dongjin Song, Dacheng Tao

Viaarxiv icon

Mitigating Reward Hacking via Information-Theoretic Reward Modeling

Feb 16, 2024
Yuchun Miao, Sen Zhang, Liang Ding, Rong Bao, Lefei Zhang, Dacheng Tao

Viaarxiv icon

Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

Feb 13, 2024
Ziyi Zhang, Sen Zhang, Yibing Zhan, Yong Luo, Yonggang Wen, Dacheng Tao

Viaarxiv icon

Large Language Models as an Indirect Reasoner: Contrapositive and Contradiction for Automated Reasoning

Feb 06, 2024
Yanfang Zhang, Yiliu Sun, Yibing Zhan, Dapeng Tao, Dacheng Tao, Chen Gong

Viaarxiv icon

A Survey on Transformer Compression

Feb 05, 2024
Yehui Tang, Yunhe Wang, Jianyuan Guo, Zhijun Tu, Kai Han, Hailin Hu, Dacheng Tao

Viaarxiv icon

Representation Surgery for Multi-Task Model Merging

Feb 05, 2024
Enneng Yang, Li Shen, Zhenyi Wang, Guibing Guo, Xiaojun Chen, Xingwei Wang, Dacheng Tao

Viaarxiv icon