Communication-Efficient Collaborative LLM Inference via Distributed Speculative Decoding

Add code
Sep 04, 2025
Figure 1 for Communication-Efficient Collaborative LLM Inference via Distributed Speculative Decoding
Figure 2 for Communication-Efficient Collaborative LLM Inference via Distributed Speculative Decoding
Figure 3 for Communication-Efficient Collaborative LLM Inference via Distributed Speculative Decoding
Figure 4 for Communication-Efficient Collaborative LLM Inference via Distributed Speculative Decoding

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: