Picture for Yueming Chen

Yueming Chen

Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding

Add code
Apr 10, 2024
Figure 1 for Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
Figure 2 for Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
Figure 3 for Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
Figure 4 for Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding
Viaarxiv icon