Picture for Frans A. Oliehoek

Frans A. Oliehoek

Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory

Add code
May 29, 2024
Figure 1 for Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
Figure 2 for Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory
Viaarxiv icon

Policy Space Response Oracles: A Survey

Add code
Mar 04, 2024
Figure 1 for Policy Space Response Oracles: A Survey
Figure 2 for Policy Space Response Oracles: A Survey
Viaarxiv icon

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

Add code
Feb 19, 2024
Figure 1 for When Do Off-Policy and On-Policy Policy Gradient Methods Align?
Figure 2 for When Do Off-Policy and On-Policy Policy Gradient Methods Align?
Figure 3 for When Do Off-Policy and On-Policy Policy Gradient Methods Align?
Figure 4 for When Do Off-Policy and On-Policy Policy Gradient Methods Align?
Viaarxiv icon

What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization

Add code
Nov 19, 2023
Figure 1 for What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization
Figure 2 for What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization
Figure 3 for What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization
Figure 4 for What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization
Viaarxiv icon

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL

Add code
Jun 04, 2023
Figure 1 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 2 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 3 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 4 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Viaarxiv icon

What model does MuZero learn?

Add code
Jun 01, 2023
Figure 1 for What model does MuZero learn?
Figure 2 for What model does MuZero learn?
Figure 3 for What model does MuZero learn?
Figure 4 for What model does MuZero learn?
Viaarxiv icon

Towards a Unifying Model of Rationality in Multiagent Systems

Add code
May 29, 2023
Viaarxiv icon

Safety Guarantees in Multi-agent Learning via Trapping Regions

Add code
Feb 27, 2023
Viaarxiv icon

Uncoupled Learning of Differential Stackelberg Equilibria with Commitments

Add code
Feb 07, 2023
Viaarxiv icon

An Analysis of Abstracted Model-Based Reinforcement Learning

Add code
Aug 30, 2022
Viaarxiv icon