Alert button
Picture for Frans A. Oliehoek

Frans A. Oliehoek

Alert button

Policy Space Response Oracles: A Survey

Add code
Bookmark button
Alert button
Mar 04, 2024
Ariyan Bighashdel, Yongzhao Wang, Stephen McAleer, Rahul Savani, Frans A. Oliehoek

Figure 1 for Policy Space Response Oracles: A Survey
Figure 2 for Policy Space Response Oracles: A Survey
Viaarxiv icon

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

Add code
Bookmark button
Alert button
Feb 19, 2024
Davide Mambelli, Stephan Bongers, Onno Zoeter, Matthijs T. J. Spaan, Frans A. Oliehoek

Viaarxiv icon

What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization

Add code
Bookmark button
Alert button
Nov 19, 2023
Zuzanna Osika, Jazmin Zatarain Salazar, Diederik M. Roijers, Frans A. Oliehoek, Pradeep K. Murukannaiah

Viaarxiv icon

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL

Add code
Bookmark button
Alert button
Jun 04, 2023
Miguel Suau, Matthijs T. J. Spaan, Frans A. Oliehoek

Figure 1 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 2 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 3 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 4 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Viaarxiv icon

What model does MuZero learn?

Add code
Bookmark button
Alert button
Jun 01, 2023
Jinke He, Thomas M. Moerland, Frans A. Oliehoek

Figure 1 for What model does MuZero learn?
Figure 2 for What model does MuZero learn?
Figure 3 for What model does MuZero learn?
Figure 4 for What model does MuZero learn?
Viaarxiv icon

Towards a Unifying Model of Rationality in Multiagent Systems

Add code
Bookmark button
Alert button
May 29, 2023
Robert Loftin, Mustafa Mert Çelikok, Frans A. Oliehoek

Viaarxiv icon

Safety Guarantees in Multi-agent Learning via Trapping Regions

Add code
Bookmark button
Alert button
Feb 27, 2023
Aleksander Czechowski, Frans A. Oliehoek

Figure 1 for Safety Guarantees in Multi-agent Learning via Trapping Regions
Figure 2 for Safety Guarantees in Multi-agent Learning via Trapping Regions
Figure 3 for Safety Guarantees in Multi-agent Learning via Trapping Regions
Figure 4 for Safety Guarantees in Multi-agent Learning via Trapping Regions
Viaarxiv icon

Uncoupled Learning of Differential Stackelberg Equilibria with Commitments

Add code
Bookmark button
Alert button
Feb 07, 2023
Robert Loftin, Mustafa Mert Çelikok, Herke van Hoof, Samuel Kaski, Frans A. Oliehoek

Viaarxiv icon

An Analysis of Abstracted Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 30, 2022
Rolf A. N. Starre, Marco Loog, Frans A. Oliehoek

Viaarxiv icon

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems

Add code
Bookmark button
Alert button
Jul 01, 2022
Miguel Suau, Jinke He, Mustafa Mert Çelikok, Matthijs T. J. Spaan, Frans A. Oliehoek

Figure 1 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 2 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 3 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 4 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Viaarxiv icon