Picture for Ning Ding

Ning Ding

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

Add code
Apr 14, 2026
Viaarxiv icon

BFMD: A Full-Match Badminton Dense Dataset for Dense Shot Captioning

Add code
Mar 26, 2026
Viaarxiv icon

StateLinFormer: Stateful Training Enhancing Long-term Memory in Navigation

Add code
Mar 24, 2026
Viaarxiv icon

AI Can Learn Scientific Taste

Add code
Mar 15, 2026
Viaarxiv icon

How Far Can Unsupervised RLVR Scale LLM Training?

Add code
Mar 09, 2026
Viaarxiv icon

Heterogeneous Agent Collaborative Reinforcement Learning

Add code
Mar 03, 2026
Viaarxiv icon

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Add code
Feb 10, 2026
Viaarxiv icon

MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling

Add code
Feb 03, 2026
Viaarxiv icon

Toward Efficient Agents: Memory, Tool learning, and Planning

Add code
Jan 20, 2026
Viaarxiv icon

M3DDM+: An improved video outpainting by a modified masking strategy

Add code
Jan 16, 2026
Viaarxiv icon