Picture for Pinyi Zhang

Pinyi Zhang

Reward Modeling from Natural Language Human Feedback

Add code
Jan 12, 2026
Viaarxiv icon