Alert button
Picture for Brendan Bennett

Brendan Bennett

Alert button

Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search

Add code
Bookmark button
Alert button
Apr 01, 2021
Dylan Ashley, Anssi Kanervisto, Brendan Bennett

Figure 1 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 2 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 3 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Figure 4 for Back to Square One: Superhuman Performance in Chutes and Ladders Through Deep Neural Networks and Tree Search
Viaarxiv icon

Incrementally Learning Functions of the Return

Add code
Bookmark button
Alert button
Jul 05, 2019
Brendan Bennett, Wesley Chung, Muhammad Zaheer, Vincent Liu

Figure 1 for Incrementally Learning Functions of the Return
Figure 2 for Incrementally Learning Functions of the Return
Viaarxiv icon

Predicting Periodicity with Temporal Difference Learning

Add code
Bookmark button
Alert button
Sep 20, 2018
Kristopher De Asis, Brendan Bennett, Richard S. Sutton

Figure 1 for Predicting Periodicity with Temporal Difference Learning
Figure 2 for Predicting Periodicity with Temporal Difference Learning
Figure 3 for Predicting Periodicity with Temporal Difference Learning
Figure 4 for Predicting Periodicity with Temporal Difference Learning
Viaarxiv icon

Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods

Add code
Bookmark button
Alert button
Feb 14, 2018
Craig Sherstan, Brendan Bennett, Kenny Young, Dylan R. Ashley, Adam White, Martha White, Richard S. Sutton

Figure 1 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 2 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 3 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 4 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Viaarxiv icon