Picture for Matthijs T. J. Spaan

Matthijs T. J. Spaan

Pessimistic Iterative Planning for Robust POMDPs

Add code
Aug 16, 2024
Viaarxiv icon

Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning

Add code
Jun 12, 2024
Viaarxiv icon

Value Improved Actor Critic Algorithms

Add code
Jun 03, 2024
Viaarxiv icon

Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications

Add code
Apr 02, 2024
Figure 1 for Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications
Figure 2 for Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications
Figure 3 for Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications
Figure 4 for Exploring LLMs as a Source of Targeted Synthetic Textual Data to Minimize High Confidence Misclassifications
Viaarxiv icon

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

Add code
Feb 19, 2024
Viaarxiv icon

Reinforcement Learning by Guided Safe Exploration

Add code
Jul 26, 2023
Figure 1 for Reinforcement Learning by Guided Safe Exploration
Figure 2 for Reinforcement Learning by Guided Safe Exploration
Figure 3 for Reinforcement Learning by Guided Safe Exploration
Figure 4 for Reinforcement Learning by Guided Safe Exploration
Viaarxiv icon

Diverse Projection Ensembles for Distributional Reinforcement Learning

Add code
Jun 12, 2023
Figure 1 for Diverse Projection Ensembles for Distributional Reinforcement Learning
Figure 2 for Diverse Projection Ensembles for Distributional Reinforcement Learning
Figure 3 for Diverse Projection Ensembles for Distributional Reinforcement Learning
Figure 4 for Diverse Projection Ensembles for Distributional Reinforcement Learning
Viaarxiv icon

The Role of Diverse Replay for Generalisation in Reinforcement Learning

Add code
Jun 09, 2023
Figure 1 for The Role of Diverse Replay for Generalisation in Reinforcement Learning
Figure 2 for The Role of Diverse Replay for Generalisation in Reinforcement Learning
Figure 3 for The Role of Diverse Replay for Generalisation in Reinforcement Learning
Figure 4 for The Role of Diverse Replay for Generalisation in Reinforcement Learning
Viaarxiv icon

Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL

Add code
Jun 04, 2023
Figure 1 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 2 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 3 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Figure 4 for Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in RL
Viaarxiv icon

Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems

Add code
Jul 01, 2022
Figure 1 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 2 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 3 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Figure 4 for Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
Viaarxiv icon