Picture for Xunliang Cai

Xunliang Cai

Alphabetical order by last name

Fast Catch-Up, Late Switching: Optimal Batch Size Scheduling via Functional Scaling Laws

Add code
Feb 15, 2026
Viaarxiv icon

SnapMLA: Efficient Long-Context MLA Decoding via Hardware-Aware FP8 Quantized Pipelining

Add code
Feb 12, 2026
Viaarxiv icon

AgentNoiseBench: Benchmarking Robustness of Tool-Using LLM Agents Under Noisy Condition

Add code
Feb 11, 2026
Viaarxiv icon

Learning to Self-Verify Makes Language Models Better Reasoners

Add code
Feb 07, 2026
Viaarxiv icon

ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training

Add code
Feb 06, 2026
Viaarxiv icon

CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs

Add code
Feb 03, 2026
Viaarxiv icon

Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory

Add code
Feb 03, 2026
Viaarxiv icon

$V_0$: A Generalist Value Model for Any Policy at State Zero

Add code
Feb 03, 2026
Viaarxiv icon

DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding

Add code
Jan 30, 2026
Viaarxiv icon

SpanNorm: Reconciling Training Stability and Performance in Deep Transformers

Add code
Jan 30, 2026
Viaarxiv icon