Picture for Li Dong

Li Dong

Sherman

Data Efficacy for Language Model Training

Add code
Jun 26, 2025
Viaarxiv icon

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Viaarxiv icon

Reinforcement Pre-Training

Add code
Jun 09, 2025
Viaarxiv icon

Rectified Sparse Attention

Add code
Jun 05, 2025
Viaarxiv icon

On-Policy RL with Optimal Reward Baseline

Add code
May 29, 2025
Viaarxiv icon

From Large AI Models to Agentic AI: A Tutorial on Future Intelligent Communications

Add code
May 28, 2025
Viaarxiv icon

Think Only When You Need with Large Hybrid-Reasoning Models

Add code
May 21, 2025
Viaarxiv icon

Reward Reasoning Model

Add code
May 20, 2025
Viaarxiv icon

MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems

Add code
May 16, 2025
Viaarxiv icon

Model as a Game: On Numerical and Spatial Consistency for Generative Games

Add code
Mar 27, 2025
Viaarxiv icon