Picture for Frans A. Oliehoek

Frans A. Oliehoek

Communicating with Speakers and Listeners of Different Pragmatic Levels

Add code
Oct 08, 2024
Viaarxiv icon

Online Planning in POMDPs with State-Requests

Add code
Jul 26, 2024
Viaarxiv icon

Inverse Concave-Utility Reinforcement Learning is Inverse Game Theory

Add code
May 29, 2024
Viaarxiv icon

Policy Space Response Oracles: A Survey

Add code
Mar 04, 2024
Viaarxiv icon

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

Add code
Feb 19, 2024
Viaarxiv icon

What Lies beyond the Pareto Front? A Survey on Decision-Support Methods for Multi-Objective Optimization

Add code
Nov 19, 2023
Viaarxiv icon

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL

Add code
Jun 04, 2023
Viaarxiv icon

What model does MuZero learn?

Add code
Jun 01, 2023
Viaarxiv icon

Towards a Unifying Model of Rationality in Multiagent Systems

Add code
May 29, 2023
Viaarxiv icon

Safety Guarantees in Multi-agent Learning via Trapping Regions

Add code
Feb 27, 2023
Viaarxiv icon