Alert button
Picture for Marcus Hutter

Marcus Hutter

Alert button

Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective

Add code
Bookmark button
Alert button
Aug 20, 2019
Tom Everitt, Marcus Hutter

Figure 1 for Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Figure 2 for Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Figure 3 for Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Figure 4 for Reward Tampering Problems and Solutions in Reinforcement Learning: A Causal Influence Diagram Perspective
Viaarxiv icon

Fairness without Regret

Add code
Bookmark button
Alert button
Jul 11, 2019
Marcus Hutter

Figure 1 for Fairness without Regret
Figure 2 for Fairness without Regret
Viaarxiv icon

Asymptotically Unambitious Artificial General Intelligence

Add code
Bookmark button
Alert button
May 29, 2019
Michael K Cohen, Badri Vellambi, Marcus Hutter

Figure 1 for Asymptotically Unambitious Artificial General Intelligence
Figure 2 for Asymptotically Unambitious Artificial General Intelligence
Figure 3 for Asymptotically Unambitious Artificial General Intelligence
Figure 4 for Asymptotically Unambitious Artificial General Intelligence
Viaarxiv icon

Conditions on Features for Temporal Difference-Like Methods to Converge

Add code
Bookmark button
Alert button
May 28, 2019
Marcus Hutter, Samuel Yang-Zhao, Sultan J. Majeed

Figure 1 for Conditions on Features for Temporal Difference-Like Methods to Converge
Figure 2 for Conditions on Features for Temporal Difference-Like Methods to Converge
Figure 3 for Conditions on Features for Temporal Difference-Like Methods to Converge
Figure 4 for Conditions on Features for Temporal Difference-Like Methods to Converge
Viaarxiv icon

Strong Asymptotic Optimality in General Environments

Add code
Bookmark button
Alert button
Mar 04, 2019
Michael K. Cohen, Elliot Catt, Marcus Hutter

Figure 1 for Strong Asymptotic Optimality in General Environments
Figure 2 for Strong Asymptotic Optimality in General Environments
Figure 3 for Strong Asymptotic Optimality in General Environments
Viaarxiv icon

Performance Guarantees for Homomorphisms Beyond Markov Decision Processes

Add code
Bookmark button
Alert button
Nov 09, 2018
Sultan Javed Majeed, Marcus Hutter

Figure 1 for Performance Guarantees for Homomorphisms Beyond Markov Decision Processes
Figure 2 for Performance Guarantees for Homomorphisms Beyond Markov Decision Processes
Figure 3 for Performance Guarantees for Homomorphisms Beyond Markov Decision Processes
Figure 4 for Performance Guarantees for Homomorphisms Beyond Markov Decision Processes
Viaarxiv icon

AGI Safety Literature Review

Add code
Bookmark button
Alert button
May 21, 2018
Tom Everitt, Gary Lea, Marcus Hutter

Figure 1 for AGI Safety Literature Review
Viaarxiv icon

A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem

Add code
Bookmark button
Alert button
Apr 12, 2018
Tom Everitt, Marcus Hutter

Figure 1 for A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem
Figure 2 for A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem
Figure 3 for A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem
Figure 4 for A Topological Approach to Meta-heuristics: Analytical Results on the BFS vs. DFS Algorithm Selection Problem
Viaarxiv icon

Reinforcement Learning with a Corrupted Reward Channel

Add code
Bookmark button
Alert button
Aug 19, 2017
Tom Everitt, Victoria Krakovna, Laurent Orseau, Marcus Hutter, Shane Legg

Figure 1 for Reinforcement Learning with a Corrupted Reward Channel
Figure 2 for Reinforcement Learning with a Corrupted Reward Channel
Figure 3 for Reinforcement Learning with a Corrupted Reward Channel
Figure 4 for Reinforcement Learning with a Corrupted Reward Channel
Viaarxiv icon

Count-Based Exploration in Feature Space for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 25, 2017
Jarryd Martin, Suraj Narayanan Sasikumar, Tom Everitt, Marcus Hutter

Figure 1 for Count-Based Exploration in Feature Space for Reinforcement Learning
Figure 2 for Count-Based Exploration in Feature Space for Reinforcement Learning
Viaarxiv icon