Picture for Philip S. Thomas

Philip S. Thomas

Reinforcement Learning for Strategic Recommendations

Add code
Sep 15, 2020
Figure 1 for Reinforcement Learning for Strategic Recommendations
Figure 2 for Reinforcement Learning for Strategic Recommendations
Figure 3 for Reinforcement Learning for Strategic Recommendations
Figure 4 for Reinforcement Learning for Strategic Recommendations
Viaarxiv icon

Evaluating the Performance of Reinforcement Learning Algorithms

Add code
Jun 30, 2020
Figure 1 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 2 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 3 for Evaluating the Performance of Reinforcement Learning Algorithms
Figure 4 for Evaluating the Performance of Reinforcement Learning Algorithms
Viaarxiv icon

Optimizing for the Future in Non-Stationary MDPs

Add code
Jun 02, 2020
Figure 1 for Optimizing for the Future in Non-Stationary MDPs
Figure 2 for Optimizing for the Future in Non-Stationary MDPs
Figure 3 for Optimizing for the Future in Non-Stationary MDPs
Figure 4 for Optimizing for the Future in Non-Stationary MDPs
Viaarxiv icon

Learning Reusable Options for Multi-Task Reinforcement Learning

Add code
Jan 06, 2020
Figure 1 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 2 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 3 for Learning Reusable Options for Multi-Task Reinforcement Learning
Figure 4 for Learning Reusable Options for Multi-Task Reinforcement Learning
Viaarxiv icon

Reinforcement learning with a network of spiking agents

Add code
Nov 10, 2019
Figure 1 for Reinforcement learning with a network of spiking agents
Figure 2 for Reinforcement learning with a network of spiking agents
Viaarxiv icon

Is the Policy Gradient a Gradient?

Add code
Jun 17, 2019
Figure 1 for Is the Policy Gradient a Gradient?
Figure 2 for Is the Policy Gradient a Gradient?
Figure 3 for Is the Policy Gradient a Gradient?
Viaarxiv icon

Classical Policy Gradient: Preserving Bellman's Principle of Optimality

Add code
Jun 06, 2019
Viaarxiv icon

Reinforcement Learning When All Actions are Not Always Available

Add code
Jun 05, 2019
Figure 1 for Reinforcement Learning When All Actions are Not Always Available
Figure 2 for Reinforcement Learning When All Actions are Not Always Available
Figure 3 for Reinforcement Learning When All Actions are Not Always Available
Figure 4 for Reinforcement Learning When All Actions are Not Always Available
Viaarxiv icon

Lifelong Learning with a Changing Action Set

Add code
Jun 05, 2019
Figure 1 for Lifelong Learning with a Changing Action Set
Figure 2 for Lifelong Learning with a Changing Action Set
Figure 3 for Lifelong Learning with a Changing Action Set
Figure 4 for Lifelong Learning with a Changing Action Set
Viaarxiv icon

A New Confidence Interval for the Mean of a Bounded Random Variable

Add code
May 15, 2019
Figure 1 for A New Confidence Interval for the Mean of a Bounded Random Variable
Figure 2 for A New Confidence Interval for the Mean of a Bounded Random Variable
Figure 3 for A New Confidence Interval for the Mean of a Bounded Random Variable
Figure 4 for A New Confidence Interval for the Mean of a Bounded Random Variable
Viaarxiv icon