Picture for Satinder Singh

Satinder Singh

Value Prediction Network

Add code
Nov 06, 2017
Figure 1 for Value Prediction Network
Figure 2 for Value Prediction Network
Figure 3 for Value Prediction Network
Figure 4 for Value Prediction Network
Viaarxiv icon

Repeated Inverse Reinforcement Learning

Add code
Nov 04, 2017
Viaarxiv icon

Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making

Add code
Mar 14, 2017
Figure 1 for Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making
Figure 2 for Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making
Figure 3 for Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making
Figure 4 for Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making
Viaarxiv icon

Control of Memory, Active Perception, and Action in Minecraft

Add code
May 30, 2016
Figure 1 for Control of Memory, Active Perception, and Action in Minecraft
Figure 2 for Control of Memory, Active Perception, and Action in Minecraft
Figure 3 for Control of Memory, Active Perception, and Action in Minecraft
Figure 4 for Control of Memory, Active Perception, and Action in Minecraft
Viaarxiv icon

Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games

Add code
Apr 24, 2016
Figure 1 for Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Figure 2 for Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Figure 3 for Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Figure 4 for Deep Learning for Reward Design to Improve Monte Carlo Tree Search in ATARI Games
Viaarxiv icon

Towards Resolving Unidentifiability in Inverse Reinforcement Learning

Add code
Jan 25, 2016
Figure 1 for Towards Resolving Unidentifiability in Inverse Reinforcement Learning
Figure 2 for Towards Resolving Unidentifiability in Inverse Reinforcement Learning
Figure 3 for Towards Resolving Unidentifiability in Inverse Reinforcement Learning
Figure 4 for Towards Resolving Unidentifiability in Inverse Reinforcement Learning
Viaarxiv icon

Action-Conditional Video Prediction using Deep Networks in Atari Games

Add code
Dec 22, 2015
Figure 1 for Action-Conditional Video Prediction using Deep Networks in Atari Games
Figure 2 for Action-Conditional Video Prediction using Deep Networks in Atari Games
Figure 3 for Action-Conditional Video Prediction using Deep Networks in Atari Games
Figure 4 for Action-Conditional Video Prediction using Deep Networks in Atari Games
Viaarxiv icon

Graphical Models for Game Theory

Add code
Mar 08, 2015
Figure 1 for Graphical Models for Game Theory
Figure 2 for Graphical Models for Game Theory
Viaarxiv icon

Learning to Make Predictions In Partially Observable Environments Without a Generative Model

Add code
Jan 16, 2014
Figure 1 for Learning to Make Predictions In Partially Observable Environments Without a Generative Model
Figure 2 for Learning to Make Predictions In Partially Observable Environments Without a Generative Model
Figure 3 for Learning to Make Predictions In Partially Observable Environments Without a Generative Model
Figure 4 for Learning to Make Predictions In Partially Observable Environments Without a Generative Model
Viaarxiv icon

Approximate Planning for Factored POMDPs using Belief State Simplification

Add code
Jan 23, 2013
Viaarxiv icon