Alert button

Provable Multi-Party Reinforcement Learning with Diverse Human Feedback

Mar 08, 2024
Huiying Zhong, Zhun Deng, Weijie J. Su, Zhiwei Steven Wu, Linjun Zhang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: