Picture for Wenkang Wu

Wenkang Wu

Staggered Batch Scheduling: Co-optimizing Time-to-First-Token and Throughput for High-Efficiency LLM Inference

Add code
Dec 18, 2025
Viaarxiv icon