Alert button

Accelerating Retrieval-Augmented Language Model Serving with Speculation

Jan 25, 2024
Zhihao Zhang, Alan Zhu, Lijie Yang, Yihua Xu, Lanting Li, Phitchaya Mangpo Phothilimthana, Zhihao Jia

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: