Picture for Carl Chengyan Fu

Carl Chengyan Fu

Efficient Speculative Decoding for Llama at Scale: Challenges and Solutions

Add code
Aug 11, 2025
Viaarxiv icon