Alert button
Picture for Zheng Wen

Zheng Wen

Alert button

RLHF and IIA: Perverse Incentives

Dec 02, 2023
Wanqiao Xu, Shi Dong, Xiuyuan Lu, Grace Lam, Zheng Wen, Benjamin Van Roy

Viaarxiv icon

Efficient Online Learning with Offline Datasets for Infinite Horizon MDPs: A Bayesian Approach

Oct 17, 2023
Dengwang Tang, Rahul Jain, Botao Hao, Zheng Wen

Viaarxiv icon

Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale

Mar 20, 2023
Botao Hao, Rahul Jain, Dengwang Tang, Zheng Wen

Figure 1 for Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale
Figure 2 for Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale
Figure 3 for Bridging Imitation and Online Reinforcement Learning: An Optimistic Tale
Viaarxiv icon

Approximate Thompson Sampling via Epistemic Neural Networks

Feb 18, 2023
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

Figure 1 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 2 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 3 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 4 for Approximate Thompson Sampling via Epistemic Neural Networks
Viaarxiv icon

Leveraging Demonstrations to Improve Online Learning: Quality Matters

Feb 08, 2023
Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen

Figure 1 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 2 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 3 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 4 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Viaarxiv icon

Robustness of Epinets against Distributional Shifts

Jul 01, 2022
Xiuyuan Lu, Ian Osband, Seyed Mohammad Asghari, Sven Gowal, Vikranth Dwaracherla, Zheng Wen, Benjamin Van Roy

Figure 1 for Robustness of Epinets against Distributional Shifts
Figure 2 for Robustness of Epinets against Distributional Shifts
Figure 3 for Robustness of Epinets against Distributional Shifts
Figure 4 for Robustness of Epinets against Distributional Shifts
Viaarxiv icon

Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping

Jun 08, 2022
Vikranth Dwaracherla, Zheng Wen, Ian Osband, Xiuyuan Lu, Seyed Mohammad Asghari, Benjamin Van Roy

Figure 1 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 2 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 3 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 4 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Viaarxiv icon

An Analysis of Ensemble Sampling

Mar 02, 2022
Chao Qin, Zheng Wen, Xiuyuan Lu, Benjamin Van Roy

Viaarxiv icon

Evaluating High-Order Predictive Distributions in Deep Learning

Feb 28, 2022
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Xiuyuan Lu, Benjamin Van Roy

Figure 1 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 2 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 3 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 4 for Evaluating High-Order Predictive Distributions in Deep Learning
Viaarxiv icon