Alert button

Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding

Add code
Bookmark button
Alert button
Sep 15, 2023
Jun Zhang, Jue Wang, Huan Li, Lidan Shou, Ke Chen, Gang Chen, Sharad Mehrotra

Figure 1 for Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding
Figure 2 for Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding
Figure 3 for Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding
Figure 4 for Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: