Picture for Ning Ding

Ning Ding

Context and Diversity Matter: The Emergence of In-Context Learning in World Models

Add code
Sep 26, 2025
Viaarxiv icon

FlowRL: Matching Reward Distributions for LLM Reasoning

Add code
Sep 18, 2025
Viaarxiv icon

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Add code
Sep 10, 2025
Figure 1 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 2 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 3 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Figure 4 for HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
Viaarxiv icon

Towards a Unified View of Large Language Model Post-Training

Add code
Sep 04, 2025
Viaarxiv icon

Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance

Add code
Aug 28, 2025
Viaarxiv icon

Evaluating Movement Initiation Timing in Ultimate Frisbee via Temporal Counterfactuals

Add code
Aug 25, 2025
Viaarxiv icon

EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization

Add code
Jun 16, 2025
Viaarxiv icon

Farseer: A Refined Scaling Law in Large Language Models

Add code
Jun 12, 2025
Figure 1 for Farseer: A Refined Scaling Law in Large Language Models
Figure 2 for Farseer: A Refined Scaling Law in Large Language Models
Figure 3 for Farseer: A Refined Scaling Law in Large Language Models
Figure 4 for Farseer: A Refined Scaling Law in Large Language Models
Viaarxiv icon