Alert button

ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding

Feb 21, 2024
Shuzhang Zhong, Zebin Yang, Meng Li, Ruihao Gong, Runsheng Wang, Ru Huang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: