Alert button
Picture for Chris Nota

Chris Nota

Alert button

On the Convergence of Discounted Policy Gradient Methods

Add code
Bookmark button
Alert button
Jan 09, 2023
Chris Nota

Viaarxiv icon

Learning Reusable Options for Multi-Task Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 06, 2020
Francisco M. Garcia, Chris Nota, Philip S. Thomas

Figure 1 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 2 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 3 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 4 for Learning Reusable Options for Multi-Task Reinforcement Learning
Viaarxiv icon

Is the Policy Gradient a Gradient?

Add code
Bookmark button
Alert button
Jun 17, 2019
Chris Nota, Philip S. Thomas

Figure 1 for Is the Policy Gradient a Gradient?
Figure 2 for Is the Policy Gradient a Gradient?
Figure 3 for Is the Policy Gradient a Gradient?
Viaarxiv icon

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

Add code
Bookmark button
Alert button
Jun 06, 2019
Philip S. Thomas, Scott M. Jordan, Yash Chandak, Chris Nota, James Kostas

Viaarxiv icon

Lifelong Learning with a Changing Action Set

Add code
Bookmark button
Alert button
Jun 05, 2019
Yash Chandak, Georgios Theocharous, Chris Nota, Philip S. Thomas

Figure 1 for Lifelong Learning with a Changing Action Set
Figure 2 for Lifelong Learning with a Changing Action Set
Figure 3 for Lifelong Learning with a Changing Action Set
Figure 4 for Lifelong Learning with a Changing Action Set
Viaarxiv icon

Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock

Add code
Bookmark button
Alert button
Feb 21, 2019
James Kostas, Chris Nota, Philip S. Thomas

Figure 1 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 2 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 3 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Figure 4 for Asynchronous Coagent Networks: Stochastic Networks for Reinforcement Learning without Backpropagation or a Clock
Viaarxiv icon