Alert button
Picture for James MacGlashan

James MacGlashan

Alert button

Sony AI

Value Function Decomposition for Iterative Design of Reinforcement Learning Agents

Add code
Bookmark button
Alert button
Jun 24, 2022
James MacGlashan, Evan Archer, Alisa Devlic, Takuma Seno, Craig Sherstan, Peter R. Wurman, Peter Stone

Figure 1 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 2 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 3 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Figure 4 for Value Function Decomposition for Iterative Design of Reinforcement Learning Agents
Viaarxiv icon

Gamma-Nets: Generalizing Value Estimation over Timescale

Add code
Bookmark button
Alert button
Nov 23, 2019
Craig Sherstan, Shibhansh Dohare, James MacGlashan, Johannes Günther, Patrick M. Pilarski

Figure 1 for Gamma-Nets: Generalizing Value Estimation over Timescale
Figure 2 for Gamma-Nets: Generalizing Value Estimation over Timescale
Figure 3 for Gamma-Nets: Generalizing Value Estimation over Timescale
Figure 4 for Gamma-Nets: Generalizing Value Estimation over Timescale
Viaarxiv icon

Implementing the Deep Q-Network

Add code
Bookmark button
Alert button
Nov 20, 2017
Melrose Roderick, James MacGlashan, Stefanie Tellex

Figure 1 for Implementing the Deep Q-Network
Figure 2 for Implementing the Deep Q-Network
Figure 3 for Implementing the Deep Q-Network
Viaarxiv icon

Environment-Independent Task Specifications via GLTL

Add code
Bookmark button
Alert button
Apr 14, 2017
Michael L. Littman, Ufuk Topcu, Jie Fu, Charles Isbell, Min Wen, James MacGlashan

Figure 1 for Environment-Independent Task Specifications via GLTL
Figure 2 for Environment-Independent Task Specifications via GLTL
Figure 3 for Environment-Independent Task Specifications via GLTL
Figure 4 for Environment-Independent Task Specifications via GLTL
Viaarxiv icon

Interactive Learning from Policy-Dependent Human Feedback

Add code
Bookmark button
Alert button
Jan 21, 2017
James MacGlashan, Mark K Ho, Robert Loftin, Bei Peng, David Roberts, Matthew E. Taylor, Michael L. Littman

Figure 1 for Interactive Learning from Policy-Dependent Human Feedback
Figure 2 for Interactive Learning from Policy-Dependent Human Feedback
Viaarxiv icon