Picture for Michael Dennis

Michael Dennis

Multi-Agent Risks from Advanced AI

Add code
Feb 19, 2025
Viaarxiv icon

BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping

Add code
Sep 09, 2024
Figure 1 for BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping
Figure 2 for BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping
Figure 3 for BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping
Figure 4 for BAMDP Shaping: a Unified Theoretical Framework for Intrinsic Motivation and Reward Shaping
Viaarxiv icon

The Benefits of Power Regularization in Cooperative Reinforcement Learning

Add code
Jun 17, 2024
Viaarxiv icon

Open-Endedness is Essential for Artificial Superhuman Intelligence

Add code
Jun 06, 2024
Figure 1 for Open-Endedness is Essential for Artificial Superhuman Intelligence
Figure 2 for Open-Endedness is Essential for Artificial Superhuman Intelligence
Figure 3 for Open-Endedness is Essential for Artificial Superhuman Intelligence
Viaarxiv icon

Genie: Generative Interactive Environments

Add code
Feb 23, 2024
Figure 1 for Genie: Generative Interactive Environments
Figure 2 for Genie: Generative Interactive Environments
Figure 3 for Genie: Generative Interactive Environments
Figure 4 for Genie: Generative Interactive Environments
Viaarxiv icon

Refining Minimax Regret for Unsupervised Environment Design

Add code
Feb 19, 2024
Viaarxiv icon

minimax: Efficient Baselines for Autocurricula in JAX

Add code
Nov 23, 2023
Figure 1 for minimax: Efficient Baselines for Autocurricula in JAX
Figure 2 for minimax: Efficient Baselines for Autocurricula in JAX
Figure 3 for minimax: Efficient Baselines for Autocurricula in JAX
Figure 4 for minimax: Efficient Baselines for Autocurricula in JAX
Viaarxiv icon

Stabilizing Unsupervised Environment Design with a Learned Adversary

Add code
Aug 22, 2023
Figure 1 for Stabilizing Unsupervised Environment Design with a Learned Adversary
Figure 2 for Stabilizing Unsupervised Environment Design with a Learned Adversary
Figure 3 for Stabilizing Unsupervised Environment Design with a Learned Adversary
Figure 4 for Stabilizing Unsupervised Environment Design with a Learned Adversary
Viaarxiv icon

Who Needs to Know? Minimal Knowledge for Optimal Coordination

Add code
Jun 15, 2023
Figure 1 for Who Needs to Know? Minimal Knowledge for Optimal Coordination
Figure 2 for Who Needs to Know? Minimal Knowledge for Optimal Coordination
Figure 3 for Who Needs to Know? Minimal Knowledge for Optimal Coordination
Figure 4 for Who Needs to Know? Minimal Knowledge for Optimal Coordination
Viaarxiv icon

MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning

Add code
Mar 06, 2023
Viaarxiv icon