Picture for Lufei Li

Lufei Li

Grad2Reward: From Sparse Judgment to Dense Rewards for Improving Open-Ended LLM Reasoning

Add code
Feb 02, 2026
Viaarxiv icon