Alert button
Picture for Wenhao Zhan

Wenhao Zhan

Alert button

Dataset Reset Policy Optimization for RLHF

Add code
Bookmark button
Alert button
Apr 16, 2024
Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

Viaarxiv icon

Optimal Multi-Distribution Learning

Add code
Bookmark button
Alert button
Dec 08, 2023
Zihan Zhang, Wenhao Zhan, Yuxin Chen, Simon S. Du, Jason D. Lee

Viaarxiv icon

Provably Efficient CVaR RL in Low-rank MDPs

Add code
Bookmark button
Alert button
Nov 20, 2023
Yulai Zhao, Wenhao Zhan, Xiaoyan Hu, Ho-fung Leung, Farzan Farnia, Wen Sun, Jason D. Lee

Viaarxiv icon

How to Query Human Feedback Efficiently in RL?

Add code
Bookmark button
Alert button
May 29, 2023
Wenhao Zhan, Masatoshi Uehara, Wen Sun, Jason D. Lee

Viaarxiv icon

Provable Offline Reinforcement Learning with Human Feedback

Add code
Bookmark button
Alert button
May 24, 2023
Wenhao Zhan, Masatoshi Uehara, Nathan Kallus, Jason D. Lee, Wen Sun

Viaarxiv icon

Reward-agnostic Fine-tuning: Provable Statistical Benefits of Hybrid Reinforcement Learning

Add code
Bookmark button
Alert button
May 17, 2023
Gen Li, Wenhao Zhan, Jason D. Lee, Yuejie Chi, Yuxin Chen

Viaarxiv icon

PAC Reinforcement Learning for Predictive State Representations

Add code
Bookmark button
Alert button
Jul 15, 2022
Wenhao Zhan, Masatoshi Uehara, Wen Sun, Jason D. Lee

Figure 1 for PAC Reinforcement Learning for Predictive State Representations
Figure 2 for PAC Reinforcement Learning for Predictive State Representations
Figure 3 for PAC Reinforcement Learning for Predictive State Representations
Figure 4 for PAC Reinforcement Learning for Predictive State Representations
Viaarxiv icon

Decentralized Optimistic Hyperpolicy Mirror Descent: Provably No-Regret Learning in Markov Games

Add code
Bookmark button
Alert button
Jun 03, 2022
Wenhao Zhan, Jason D. Lee, Zhuoran Yang

Viaarxiv icon

Offline Reinforcement Learning with Realizability and Single-policy Concentrability

Add code
Bookmark button
Alert button
Feb 11, 2022
Wenhao Zhan, Baihe Huang, Audrey Huang, Nan Jiang, Jason D. Lee

Figure 1 for Offline Reinforcement Learning with Realizability and Single-policy Concentrability
Figure 2 for Offline Reinforcement Learning with Realizability and Single-policy Concentrability
Viaarxiv icon