Picture for Chris Tong

Chris Tong

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning

Add code
Feb 03, 2026
Viaarxiv icon

Speculative Decoding in Decentralized LLM Inference: Turning Communication Latency into Computation Throughput

Add code
Nov 13, 2025
Figure 1 for Speculative Decoding in Decentralized LLM Inference: Turning Communication Latency into Computation Throughput
Figure 2 for Speculative Decoding in Decentralized LLM Inference: Turning Communication Latency into Computation Throughput
Figure 3 for Speculative Decoding in Decentralized LLM Inference: Turning Communication Latency into Computation Throughput
Viaarxiv icon