Picture for Yuan Lyu

Yuan Lyu

Theoretically Optimal Attention/FFN Ratios in Disaggregated LLM Serving

Add code
Jan 29, 2026
Viaarxiv icon