Picture for Meixuan Wang

Meixuan Wang

Theoretically Optimal Attention/FFN Ratios in Disaggregated LLM Serving

Add code
Jan 29, 2026
Viaarxiv icon

LLM Serving Optimization with Variable Prefill and Decode Lengths

Add code
Aug 08, 2025
Viaarxiv icon