Alert button
Picture for Marcus Hutter

Marcus Hutter

Alert button

Universal Reinforcement Learning Algorithms: Survey and Experiments

Add code
Bookmark button
Alert button
May 30, 2017
John Aslanides, Jan Leike, Marcus Hutter

Figure 1 for Universal Reinforcement Learning Algorithms: Survey and Experiments
Figure 2 for Universal Reinforcement Learning Algorithms: Survey and Experiments
Figure 3 for Universal Reinforcement Learning Algorithms: Survey and Experiments
Figure 4 for Universal Reinforcement Learning Algorithms: Survey and Experiments
Viaarxiv icon

Generalised Discount Functions applied to a Monte-Carlo AImu Implementation

Add code
Bookmark button
Alert button
Mar 03, 2017
Sean Lamont, John Aslanides, Jan Leike, Marcus Hutter

Figure 1 for Generalised Discount Functions applied to a Monte-Carlo AImu Implementation
Figure 2 for Generalised Discount Functions applied to a Monte-Carlo AImu Implementation
Figure 3 for Generalised Discount Functions applied to a Monte-Carlo AImu Implementation
Figure 4 for Generalised Discount Functions applied to a Monte-Carlo AImu Implementation
Viaarxiv icon

Free Lunch for Optimisation under the Universal Distribution

Add code
Bookmark button
Alert button
Aug 16, 2016
Tom Everitt, Tor Lattimore, Marcus Hutter

Figure 1 for Free Lunch for Optimisation under the Universal Distribution
Viaarxiv icon

Thompson Sampling is Asymptotically Optimal in General Environments

Add code
Bookmark button
Alert button
Jun 03, 2016
Jan Leike, Tor Lattimore, Laurent Orseau, Marcus Hutter

Viaarxiv icon

Death and Suicide in Universal Artificial Intelligence

Add code
Bookmark button
Alert button
Jun 02, 2016
Jarryd Martin, Tom Everitt, Marcus Hutter

Figure 1 for Death and Suicide in Universal Artificial Intelligence
Figure 2 for Death and Suicide in Universal Artificial Intelligence
Viaarxiv icon

Avoiding Wireheading with Value Reinforcement Learning

Add code
Bookmark button
Alert button
May 10, 2016
Tom Everitt, Marcus Hutter

Figure 1 for Avoiding Wireheading with Value Reinforcement Learning
Figure 2 for Avoiding Wireheading with Value Reinforcement Learning
Figure 3 for Avoiding Wireheading with Value Reinforcement Learning
Figure 4 for Avoiding Wireheading with Value Reinforcement Learning
Viaarxiv icon

Self-Modification of Policy and Utility Function in Rational Agents

Add code
Bookmark button
Alert button
May 10, 2016
Tom Everitt, Daniel Filan, Mayank Daswani, Marcus Hutter

Figure 1 for Self-Modification of Policy and Utility Function in Rational Agents
Figure 2 for Self-Modification of Policy and Utility Function in Rational Agents
Figure 3 for Self-Modification of Policy and Utility Function in Rational Agents
Viaarxiv icon

Loss Bounds and Time Complexity for Speed Priors

Add code
Bookmark button
Alert button
Apr 12, 2016
Daniel Filan, Marcus Hutter, Jan Leike

Viaarxiv icon

On the Computability of AIXI

Add code
Bookmark button
Alert button
Oct 19, 2015
Jan Leike, Marcus Hutter

Figure 1 for On the Computability of AIXI
Figure 2 for On the Computability of AIXI
Figure 3 for On the Computability of AIXI
Viaarxiv icon

Bad Universal Priors and Notions of Optimality

Add code
Bookmark button
Alert button
Oct 16, 2015
Jan Leike, Marcus Hutter

Figure 1 for Bad Universal Priors and Notions of Optimality
Figure 2 for Bad Universal Priors and Notions of Optimality
Viaarxiv icon