Picture for Raja Gond

Raja Gond

LLM-42: Enabling Determinism in LLM Inference with Verified Speculation

Add code
Jan 25, 2026
Viaarxiv icon

TokenWeave: Efficient Compute-Communication Overlap for Distributed LLM Inference

Add code
May 16, 2025
Viaarxiv icon