Picture for Jiakun Fan

Jiakun Fan

AgentCgroup: Understanding and Controlling OS Resources of AI Agents

Add code
Feb 10, 2026
Viaarxiv icon

WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching

Add code
Jan 15, 2026
Viaarxiv icon

SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving

Add code
Jun 11, 2025
Viaarxiv icon