Picture for Ao Lu

Ao Lu

Grad2Reward: From Sparse Judgment to Dense Rewards for Improving Open-Ended LLM Reasoning

Add code
Feb 02, 2026
Viaarxiv icon