Picture for Pei-Chi Pan

Pei-Chi Pan

Reward Modeling for Reinforcement Learning-Based LLM Reasoning: Design, Challenges, and Evaluation

Add code
Feb 10, 2026
Viaarxiv icon