Alert button
Picture for James Kostas

James Kostas

Alert button

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

Add code
Bookmark button
Alert button
Jun 06, 2019
Philip S. Thomas, Scott M. Jordan, Yash Chandak, Chris Nota, James Kostas

Viaarxiv icon

Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock

Add code
Bookmark button
Alert button
Feb 21, 2019
James Kostas, Chris Nota, Philip S. Thomas

Figure 1 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 2 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 3 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 4 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Viaarxiv icon

Learning Action Representations for Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 01, 2019
Yash Chandak, Georgios Theocharous, James Kostas, Scott Jordan, Philip S. Thomas

Figure 1 for Learning Action Representations for Reinforcement Learning
Figure 2 for Learning Action Representations for Reinforcement Learning
Figure 3 for Learning Action Representations for Reinforcement Learning
Figure 4 for Learning Action Representations for Reinforcement Learning
Viaarxiv icon