Alert button

Cascade Speculative Drafting for Even Faster LLM Inference

Dec 21, 2023
Ziyi Chen, Xiaocong Yang, Jiacheng Lin, Chenkai Sun, Jie Huang, Kevin Chen-Chuan Chang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: