Alert button
Picture for Wesley Chung

Wesley Chung

Alert button

The Role of Baselines in Policy Gradient Optimization

Add code
Bookmark button
Alert button
Jan 16, 2023
Jincheng Mei, Wesley Chung, Valentin Thomas, Bo Dai, Csaba Szepesvari, Dale Schuurmans

Figure 1 for The Role of Baselines in Policy Gradient Optimization
Figure 2 for The Role of Baselines in Policy Gradient Optimization
Figure 3 for The Role of Baselines in Policy Gradient Optimization
Viaarxiv icon

Beyond variance reduction: Understanding the true impact of baselines on policy optimization

Add code
Bookmark button
Alert button
Aug 31, 2020
Wesley Chung, Valentin Thomas, Marlos C. Machado, Nicolas Le Roux

Figure 1 for Beyond variance reduction: Understanding the true impact of baselines on policy optimization
Figure 2 for Beyond variance reduction: Understanding the true impact of baselines on policy optimization
Figure 3 for Beyond variance reduction: Understanding the true impact of baselines on policy optimization
Figure 4 for Beyond variance reduction: Understanding the true impact of baselines on policy optimization
Viaarxiv icon

Incrementally Learning Functions of the Return

Add code
Bookmark button
Alert button
Jul 05, 2019
Brendan Bennett, Wesley Chung, Muhammad Zaheer, Vincent Liu

Figure 1 for Incrementally Learning Functions of the Return
Figure 2 for Incrementally Learning Functions of the Return
Viaarxiv icon

Importance Resampling for Off-policy Prediction

Add code
Bookmark button
Alert button
Jun 11, 2019
Matthew Schlegel, Wesley Chung, Daniel Graves, Jian Qian, Martha White

Figure 1 for Importance Resampling for Off-policy Prediction
Figure 2 for Importance Resampling for Off-policy Prediction
Figure 3 for Importance Resampling for Off-policy Prediction
Figure 4 for Importance Resampling for Off-policy Prediction
Viaarxiv icon

High-confidence error estimates for learned value functions

Add code
Bookmark button
Alert button
Aug 28, 2018
Touqir Sajed, Wesley Chung, Martha White

Figure 1 for High-confidence error estimates for learned value functions
Figure 2 for High-confidence error estimates for learned value functions
Viaarxiv icon