Alert button
Picture for Peter Sunehag

Peter Sunehag

Alert button

Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions

Add code
Bookmark button
Alert button
Dec 16, 2015
Peter Sunehag, Richard Evans, Gabriel Dulac-Arnold, Yori Zwols, Daniel Visentin, Ben Coppin

Figure 1 for Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
Figure 2 for Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
Figure 3 for Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
Figure 4 for Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
Viaarxiv icon

The Sample-Complexity of General Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 22, 2013
Tor Lattimore, Marcus Hutter, Peter Sunehag

Figure 1 for The Sample-Complexity of General Reinforcement Learning
Figure 2 for The Sample-Complexity of General Reinforcement Learning
Figure 3 for The Sample-Complexity of General Reinforcement Learning
Viaarxiv icon

On Nicod's Condition, Rules of Induction and the Raven Paradox

Add code
Bookmark button
Alert button
Jul 16, 2013
Hadi Mohasel Afshar, Peter Sunehag

Figure 1 for On Nicod's Condition, Rules of Induction and the Raven Paradox
Viaarxiv icon

Concentration and Confidence for Discrete Bayesian Sequence Predictors

Add code
Bookmark button
Alert button
Jun 29, 2013
Tor Lattimore, Marcus Hutter, Peter Sunehag

Figure 1 for Concentration and Confidence for Discrete Bayesian Sequence Predictors
Figure 2 for Concentration and Confidence for Discrete Bayesian Sequence Predictors
Figure 3 for Concentration and Confidence for Discrete Bayesian Sequence Predictors
Viaarxiv icon

Optimistic Agents are Asymptotically Optimal

Add code
Bookmark button
Alert button
Sep 29, 2012
Peter Sunehag, Marcus Hutter

Viaarxiv icon

Adaptive Context Tree Weighting

Add code
Bookmark button
Alert button
Jan 10, 2012
Alexander O'Neill, Marcus Hutter, Wen Shao, Peter Sunehag

Figure 1 for Adaptive Context Tree Weighting
Figure 2 for Adaptive Context Tree Weighting
Figure 3 for Adaptive Context Tree Weighting
Figure 4 for Adaptive Context Tree Weighting
Viaarxiv icon

Principles of Solomonoff Induction and AIXI

Add code
Bookmark button
Alert button
Nov 25, 2011
Peter Sunehag, Marcus Hutter

Viaarxiv icon

Feature Reinforcement Learning In Practice

Add code
Bookmark button
Alert button
Aug 18, 2011
Phuong Nguyen, Peter Sunehag, Marcus Hutter

Figure 1 for Feature Reinforcement Learning In Practice
Figure 2 for Feature Reinforcement Learning In Practice
Figure 3 for Feature Reinforcement Learning In Practice
Figure 4 for Feature Reinforcement Learning In Practice
Viaarxiv icon

Axioms for Rational Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 27, 2011
Peter Sunehag, Marcus Hutter

Viaarxiv icon

Consistency of Feature Markov Processes

Add code
Bookmark button
Alert button
Jul 13, 2010
Peter Sunehag, Marcus Hutter

Viaarxiv icon