Picture for Ning Ding

Ning Ding

How Far Can Unsupervised RLVR Scale LLM Training?

Add code
Mar 09, 2026
Viaarxiv icon

Heterogeneous Agent Collaborative Reinforcement Learning

Add code
Mar 03, 2026
Viaarxiv icon

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Add code
Feb 10, 2026
Viaarxiv icon

MeKi: Memory-based Expert Knowledge Injection for Efficient LLM Scaling

Add code
Feb 03, 2026
Viaarxiv icon

Toward Efficient Agents: Memory, Tool learning, and Planning

Add code
Jan 20, 2026
Viaarxiv icon

M3DDM+: An improved video outpainting by a modified masking strategy

Add code
Jan 16, 2026
Viaarxiv icon

Emotion-Director: Bridging Affective Shortcut in Emotion-Oriented Image Generation

Add code
Dec 22, 2025
Viaarxiv icon

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Add code
Dec 18, 2025
Viaarxiv icon

Accurate de novo sequencing of the modified proteome with OmniNovo

Add code
Dec 13, 2025
Viaarxiv icon

P1: Mastering Physics Olympiads with Reinforcement Learning

Add code
Nov 17, 2025
Viaarxiv icon