Alert button

Parallel Decoding via Hidden Transfer for Lossless Large Language Model Acceleration

Apr 18, 2024
Pengfei Wu, Jiahao Liu, Zhuocheng Gong, Qifan Wang, Jinpeng Li, Jingang Wang, Xunliang Cai, Dongyan Zhao

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: