Alert button
Picture for Craig Sherstan

Craig Sherstan

Alert button

Sony AI

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Add code
Bookmark button
Alert button
Jun 24, 2022
James MacGlashan, Evan Archer, Alisa Devlic, Takuma Seno, Craig Sherstan, Peter R. Wurman, Peter Stone

Figure 1 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 2 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 3 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 4 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Viaarxiv icon

Work in Progress: Temporally Extended Auxiliary Tasks

Add code
Bookmark button
Alert button
Apr 16, 2020
Craig Sherstan, Bilal Kartal, Pablo Hernandez-Leal, Matthew E. Taylor

Figure 1 for Work in Progress: Temporally Extended Auxiliary Tasks
Figure 2 for Work in Progress: Temporally Extended Auxiliary Tasks
Figure 3 for Work in Progress: Temporally Extended Auxiliary Tasks
Figure 4 for Work in Progress: Temporally Extended Auxiliary Tasks
Viaarxiv icon

Gamma-Nets: Generalizing Value Estimation over Timescale

Add code
Bookmark button
Alert button
Nov 23, 2019
Craig Sherstan, Shibhansh Dohare, James MacGlashan, Johannes Günther, Patrick M. Pilarski

Figure 1 for Gamma-Nets: Generalizing Value Estimation over Timescale
Figure 2 for Gamma-Nets: Generalizing Value Estimation over Timescale
Figure 3 for Gamma-Nets: Generalizing Value Estimation over Timescale
Figure 4 for Gamma-Nets: Generalizing Value Estimation over Timescale
Viaarxiv icon

Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation

Add code
Bookmark button
Alert button
Mar 23, 2018
Craig Sherstan, Marlos C. Machado, Patrick M. Pilarski

Figure 1 for Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation
Figure 2 for Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation
Figure 3 for Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation
Figure 4 for Accelerating Learning in Constructive Predictive Frameworks with the Successor Representation
Viaarxiv icon

Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods

Add code
Bookmark button
Alert button
Feb 14, 2018
Craig Sherstan, Brendan Bennett, Kenny Young, Dylan R. Ashley, Adam White, Martha White, Richard S. Sutton

Figure 1 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 2 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 3 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Figure 4 for Directly Estimating the Variance of the λ-Return Using Temporal-Difference Methods
Viaarxiv icon

Communicative Capital for Prosthetic Agents

Add code
Bookmark button
Alert button
Nov 10, 2017
Patrick M. Pilarski, Richard S. Sutton, Kory W. Mathewson, Craig Sherstan, Adam S. R. Parker, Ann L. Edwards

Figure 1 for Communicative Capital for Prosthetic Agents
Figure 2 for Communicative Capital for Prosthetic Agents
Figure 3 for Communicative Capital for Prosthetic Agents
Figure 4 for Communicative Capital for Prosthetic Agents
Viaarxiv icon