Picture for Colin Cai

Colin Cai

Nexus: Taming Throughput-Latency Tradeoff in LLM Serving via Efficient GPU Sharing

Add code
Jul 09, 2025
Viaarxiv icon

Autellix: An Efficient Serving Engine for LLM Agents as General Programs

Add code
Feb 19, 2025
Viaarxiv icon