Picture for Ning Ding

Ning Ding

Toward Efficient Agents: Memory, Tool learning, and Planning

Add code
Jan 20, 2026
Viaarxiv icon

M3DDM+: An improved video outpainting by a modified masking strategy

Add code
Jan 16, 2026
Viaarxiv icon

Emotion-Director: Bridging Affective Shortcut in Emotion-Oriented Image Generation

Add code
Dec 22, 2025
Viaarxiv icon

JustRL: Scaling a 1.5B LLM with a Simple RL Recipe

Add code
Dec 18, 2025
Viaarxiv icon

Accurate de novo sequencing of the modified proteome with OmniNovo

Add code
Dec 13, 2025
Viaarxiv icon

P1: Mastering Physics Olympiads with Reinforcement Learning

Add code
Nov 17, 2025
Viaarxiv icon

W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search

Add code
Nov 14, 2025
Figure 1 for W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search
Figure 2 for W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search
Figure 3 for W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search
Figure 4 for W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search
Viaarxiv icon

Context and Diversity Matter: The Emergence of In-Context Learning in World Models

Add code
Sep 26, 2025
Viaarxiv icon

FlowRL: Matching Reward Distributions for LLM Reasoning

Add code
Sep 18, 2025
Viaarxiv icon

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon