Alert button
Picture for Wen Sun

Wen Sun

Alert button

Dataset Reset Policy Optimization for RLHF

Add code
Bookmark button
Alert button
Apr 15, 2024
Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

Viaarxiv icon

Adversarial Imitation Learning via Boosting

Add code
Bookmark button
Alert button
Apr 12, 2024
Jonathan D. Chang, Dhruv Sreenivas, Yingbing Huang, Kianté Brantley, Wen Sun

Viaarxiv icon

Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes

Add code
Bookmark button
Alert button
Mar 29, 2024
Andrew Bennett, Nathan Kallus, Miruna Oprescu, Wen Sun, Kaiwen Wang

Viaarxiv icon

RL for Consistency Models: Faster Reward Guided Text-to-Image Generation

Add code
Bookmark button
Alert button
Mar 25, 2024
Owen Oertell, Jonathan D. Chang, Yiyi Zhang, Kianté Brantley, Wen Sun

Viaarxiv icon

Risk-Sensitive RL with Optimized Certainty Equivalents via Reduction to Standard RL

Add code
Bookmark button
Alert button
Mar 10, 2024
Kaiwen Wang, Dawen Liang, Nathan Kallus, Wen Sun

Figure 1 for Risk-Sensitive RL with Optimized Certainty Equivalents via Reduction to Standard RL
Figure 2 for Risk-Sensitive RL with Optimized Certainty Equivalents via Reduction to Standard RL
Figure 3 for Risk-Sensitive RL with Optimized Certainty Equivalents via Reduction to Standard RL
Viaarxiv icon

Koopman-Assisted Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 04, 2024
Preston Rozwood, Edward Mehrez, Ludger Paehler, Wen Sun, Steven L. Brunton

Figure 1 for Koopman-Assisted Reinforcement Learning
Figure 2 for Koopman-Assisted Reinforcement Learning
Figure 3 for Koopman-Assisted Reinforcement Learning
Figure 4 for Koopman-Assisted Reinforcement Learning
Viaarxiv icon

More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 11, 2024
Kaiwen Wang, Owen Oertell, Alekh Agarwal, Nathan Kallus, Wen Sun

Viaarxiv icon

Provably Efficient CVaR RL in Low-rank MDPs

Add code
Bookmark button
Alert button
Nov 20, 2023
Yulai Zhao, Wenhao Zhan, Xiaoyan Hu, Ho-fung Leung, Farzan Farnia, Wen Sun, Jason D. Lee

Viaarxiv icon

Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees

Add code
Bookmark button
Alert button
Nov 14, 2023
Yifei Zhou, Ayush Sekhari, Yuda Song, Wen Sun

Viaarxiv icon

Reward Finetuning for Faster and More Accurate Unsupervised Object Discovery

Add code
Bookmark button
Alert button
Nov 05, 2023
Katie Z Luo, Zhenzhen Liu, Xiangyu Chen, Yurong You, Sagie Benaim, Cheng Perng Phoo, Mark Campbell, Wen Sun, Bharath Hariharan, Kilian Q. Weinberger

Viaarxiv icon