Picture for Younjoo Lee

Younjoo Lee

DyLLM: Efficient Diffusion LLM Inference via Saliency-based Token Selection and Partial Attention

Add code
Mar 09, 2026
Viaarxiv icon

From Tokens to Layers: Redefining Stall-Free Scheduling for LLM Serving with Layered Prefill

Add code
Oct 09, 2025
Viaarxiv icon