Picture for Ning Ding

Ning Ding

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Add code
Sep 10, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

Towards a Unified View of Large Language Model Post-Training

Add code
Sep 04, 2025
Viaarxiv icon

Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance

Add code
Aug 28, 2025
Viaarxiv icon

Evaluating Movement Initiation Timing in Ultimate Frisbee via Temporal Counterfactuals

Add code
Aug 25, 2025
Viaarxiv icon

EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization

Add code
Jun 16, 2025
Viaarxiv icon

Farseer: A Refined Scaling Law in Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

Automating Exploratory Multiomics Research via Language Models

Add code
Jun 09, 2025
Viaarxiv icon