Picture for Zhuoyuan Hao

Zhuoyuan Hao

Reproducing, Analyzing, and Detecting Reward Hacking in Rubric-Based Reinforcement Learning

Add code
Jun 03, 2026
Viaarxiv icon

Echoes as Anchors: Probabilistic Costs and Attention Refocusing in LLM Reasoning

Add code
Feb 06, 2026
Viaarxiv icon