Alert button

Towards Fast Inference: Exploring and Improving Blockwise Parallel Drafts

Apr 14, 2024
Taehyeon Kim, Ananda Theertha Suresh, Kishore Papineni, Michael Riley, Sanjiv Kumar, Adrian Benton

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: