Picture for Max

Max

Speculative Decoding in Decentralized LLM Inference: Turning Communication Latency into Computation Throughput

Add code
Nov 13, 2025
Viaarxiv icon