Picture for Yuhsun Huang

Yuhsun Huang

Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding

Add code
Feb 29, 2024
Figure 1 for Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Figure 2 for Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Figure 3 for Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Figure 4 for Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding
Viaarxiv icon