Picture for Yuezhi Che

Yuezhi Che

Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing

Add code
Dec 19, 2025
Viaarxiv icon