Picture for Xinzi Liu

Xinzi Liu

Reward Evolution with Graph-of-Thoughts: A Bi-Level Language Model Framework for Reinforcement Learning

Add code
Sep 19, 2025
Viaarxiv icon