Alert button
Picture for Yunfei Cheng

Yunfei Cheng

Alert button

Recurrent Drafter for Fast Speculative Decoding in Large Language Models

Add code
Bookmark button
Alert button
Mar 22, 2024
Aonan Zhang, Chong Wang, Yi Wang, Xuanyu Zhang, Yunfei Cheng

Figure 1 for Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Figure 2 for Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Figure 3 for Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Figure 4 for Recurrent Drafter for Fast Speculative Decoding in Large Language Models
Viaarxiv icon