Picture for Chris Tong

Chris Tong

Speculative Decoding in Decentralized LLM Inference: Turning Communication Latency into Computation Throughput

Add code
Nov 13, 2025
Viaarxiv icon