Picture for Peter Sunehag

Peter Sunehag

Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions

Add code
Dec 16, 2015
Figure 1 for Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
Figure 2 for Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
Figure 3 for Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
Figure 4 for Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
Viaarxiv icon

The Sample-Complexity of General Reinforcement Learning

Add code
Aug 22, 2013
Figure 1 for The Sample-Complexity of General Reinforcement Learning
Figure 2 for The Sample-Complexity of General Reinforcement Learning
Figure 3 for The Sample-Complexity of General Reinforcement Learning
Viaarxiv icon

On Nicod's Condition, Rules of Induction and the Raven Paradox

Add code
Jul 16, 2013
Figure 1 for On Nicod's Condition, Rules of Induction and the Raven Paradox
Viaarxiv icon

Concentration and Confidence for Discrete Bayesian Sequence Predictors

Add code
Jun 29, 2013
Figure 1 for Concentration and Confidence for Discrete Bayesian Sequence Predictors
Figure 2 for Concentration and Confidence for Discrete Bayesian Sequence Predictors
Figure 3 for Concentration and Confidence for Discrete Bayesian Sequence Predictors
Viaarxiv icon

Optimistic Agents are Asymptotically Optimal

Add code
Sep 29, 2012
Viaarxiv icon

Adaptive Context Tree Weighting

Add code
Jan 10, 2012
Figure 1 for Adaptive Context Tree Weighting
Figure 2 for Adaptive Context Tree Weighting
Figure 3 for Adaptive Context Tree Weighting
Figure 4 for Adaptive Context Tree Weighting
Viaarxiv icon

Principles of Solomonoff Induction and AIXI

Add code
Nov 25, 2011
Viaarxiv icon

Feature Reinforcement Learning In Practice

Add code
Aug 18, 2011
Figure 1 for Feature Reinforcement Learning In Practice
Figure 2 for Feature Reinforcement Learning In Practice
Figure 3 for Feature Reinforcement Learning In Practice
Figure 4 for Feature Reinforcement Learning In Practice
Viaarxiv icon

Axioms for Rational Reinforcement Learning

Add code
Jul 27, 2011
Viaarxiv icon

Consistency of Feature Markov Processes

Add code
Jul 13, 2010
Viaarxiv icon