Alert button

The Synergy of Speculative Decoding and Batching in Serving Large Language Models

Oct 28, 2023
Qidong Su, Christina Giannoula, Gennady Pekhimenko

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: