Alert button

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

Feb 19, 2024
Zhuoming Chen, Avner May, Ruslan Svirschevski, Yuhsun Huang, Max Ryabinin, Zhihao Jia, Beidi Chen

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: