Picture for Daniel J. Mankowitz

Daniel J. Mankowitz

Robust Reinforcement Learning for Continuous Control with Model Misspecification

Add code
Jun 18, 2019
Figure 1 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 2 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 3 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Figure 4 for Robust Reinforcement Learning for Continuous Control with Model Misspecification
Viaarxiv icon

Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces

Add code
May 23, 2019
Figure 1 for Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
Figure 2 for Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
Figure 3 for Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
Figure 4 for Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
Viaarxiv icon

Soft-Robust Actor-Critic Policy-Gradient

Add code
Oct 24, 2018
Figure 1 for Soft-Robust Actor-Critic Policy-Gradient
Figure 2 for Soft-Robust Actor-Critic Policy-Gradient
Viaarxiv icon

Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning

Add code
Sep 06, 2018
Figure 1 for Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Figure 2 for Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Figure 3 for Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Figure 4 for Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning
Viaarxiv icon

Unicorn: Continual Learning with a Universal, Off-policy Agent

Add code
Jul 03, 2018
Figure 1 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 2 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 3 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 4 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Viaarxiv icon

Reward Constrained Policy Optimization

Add code
May 28, 2018
Figure 1 for Reward Constrained Policy Optimization
Figure 2 for Reward Constrained Policy Optimization
Figure 3 for Reward Constrained Policy Optimization
Figure 4 for Reward Constrained Policy Optimization
Viaarxiv icon

Learning Robust Options

Add code
Feb 09, 2018
Figure 1 for Learning Robust Options
Figure 2 for Learning Robust Options
Figure 3 for Learning Robust Options
Viaarxiv icon

Situationally Aware Options

Add code
Nov 20, 2017
Figure 1 for Situationally Aware Options
Figure 2 for Situationally Aware Options
Figure 3 for Situationally Aware Options
Figure 4 for Situationally Aware Options
Viaarxiv icon

Shallow Updates for Deep Reinforcement Learning

Add code
Nov 02, 2017
Figure 1 for Shallow Updates for Deep Reinforcement Learning
Figure 2 for Shallow Updates for Deep Reinforcement Learning
Figure 3 for Shallow Updates for Deep Reinforcement Learning
Figure 4 for Shallow Updates for Deep Reinforcement Learning
Viaarxiv icon

A Deep Hierarchical Approach to Lifelong Learning in Minecraft

Add code
Nov 30, 2016
Figure 1 for A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Figure 2 for A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Figure 3 for A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Figure 4 for A Deep Hierarchical Approach to Lifelong Learning in Minecraft
Viaarxiv icon