Alert button
Picture for Owen Oertell

Owen Oertell

Alert button

Dataset Reset Policy Optimization for RLHF

Add code
Bookmark button
Alert button
Apr 16, 2024
Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

Viaarxiv icon

RL for Consistency Models: Faster Reward Guided Text-to-Image Generation

Add code
Bookmark button
Alert button
Mar 25, 2024
Owen Oertell, Jonathan D. Chang, Yiyi Zhang, Kianté Brantley, Wen Sun

Viaarxiv icon

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 11, 2024
Kaiwen Wang, Owen Oertell, Alekh Agarwal, Nathan Kallus, Wen Sun

Viaarxiv icon