Alert button

DRLC: Reinforcement Learning with Dense Rewards from LLM Critic

Jan 14, 2024
Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: