Picture for Mingcong Song

Mingcong Song

Metis: Bridging Text and Code Memory for Self-Evolving Agents

Add code
Jun 23, 2026
Viaarxiv icon

LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding

Add code
Dec 22, 2025
Figure 1 for LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
Figure 2 for LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
Figure 3 for LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
Figure 4 for LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding
Viaarxiv icon

Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels

Add code
Dec 24, 2024
Figure 1 for Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels
Figure 2 for Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels
Figure 3 for Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels
Figure 4 for Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels
Viaarxiv icon