Picture for Zhizhao Zeng

Zhizhao Zeng

Rectify Evaluation Preference: Improving LLMs' Critique on Math Reasoning via Perplexity-aware Reinforcement Learning

Add code
Nov 13, 2025
Viaarxiv icon