Picture for Runpeng Dai

Runpeng Dai

G-Zero: Self-Play for Open-Ended Generation from Zero Data

Add code
May 11, 2026
Viaarxiv icon

DeltaRubric: Generative Multimodal Reward Modeling via Joint Planning and Verification

Add code
May 10, 2026
Viaarxiv icon

Reinforcing Multimodal Reasoning Against Visual Degradation

Add code
May 10, 2026
Viaarxiv icon

Parallel-Probe: Towards Efficient Parallel Thinking via 2D Probing

Add code
Feb 03, 2026
Viaarxiv icon

StatEval: A Comprehensive Benchmark for Large Language Models in Statistics

Add code
Oct 10, 2025
Viaarxiv icon

VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning

Add code
Oct 01, 2025
Figure 1 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 2 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 3 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Figure 4 for VOGUE: Guiding Exploration with Visual Uncertainty Improves Multimodal Reasoning
Viaarxiv icon

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models

Add code
Sep 11, 2025
Figure 1 for CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Figure 2 for CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Figure 3 for CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Figure 4 for CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Viaarxiv icon

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Add code
Sep 09, 2025
Viaarxiv icon

Deep Distributional Learning with Non-crossing Quantile Network

Add code
Apr 11, 2025
Viaarxiv icon

Spatio-temporal Prediction of Fine-Grained Origin-Destination Matrices with Applications in Ridesharing

Add code
Mar 31, 2025
Viaarxiv icon