Picture for Ning Ding

Ning Ding

Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance

Add code
Aug 28, 2025
Viaarxiv icon

Evaluating Movement Initiation Timing in Ultimate Frisbee via Temporal Counterfactuals

Add code
Aug 25, 2025
Viaarxiv icon

EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization

Add code
Jun 16, 2025
Viaarxiv icon

Farseer: A Refined Scaling Law in Large Language Models

Add code
Jun 12, 2025
Viaarxiv icon

MiniCPM4: Ultra-Efficient LLMs on End Devices

Add code
Jun 09, 2025
Viaarxiv icon

Learning to Focus: Causal Attention Distillation via Gradient-Guided Token Pruning

Add code
Jun 09, 2025
Viaarxiv icon

Automating Exploratory Multiomics Research via Language Models

Add code
Jun 09, 2025
Viaarxiv icon

The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models

Add code
May 28, 2025
Viaarxiv icon

Self-Reflective Reinforcement Learning for Diffusion-based Image Reasoning Generation

Add code
May 28, 2025
Viaarxiv icon

The Overthinker's DIET: Cutting Token Calories with DIfficulty-AwarE Training

Add code
May 25, 2025
Viaarxiv icon