Picture for Matthijs T. J. Spaan

Matthijs T. J. Spaan

Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration

Add code
Mar 25, 2026
Viaarxiv icon

On the Equivalence of Random Network Distillation, Deep Ensembles, and Bayesian Inference

Add code
Feb 26, 2026
Viaarxiv icon

Sparse Masked Attention Policies for Reliable Generalization

Add code
Feb 23, 2026
Viaarxiv icon

Parallelizing Tree Search with Twice Sequential Monte Carlo

Add code
Nov 18, 2025
Viaarxiv icon

Universal Value-Function Uncertainties

Add code
May 27, 2025
Figure 1 for Universal Value-Function Uncertainties
Figure 2 for Universal Value-Function Uncertainties
Figure 3 for Universal Value-Function Uncertainties
Figure 4 for Universal Value-Function Uncertainties
Viaarxiv icon

Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks

Add code
May 24, 2025
Figure 1 for Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks
Figure 2 for Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks
Figure 3 for Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks
Figure 4 for Bayesian Meta-Reinforcement Learning with Laplace Variational Recurrent Networks
Viaarxiv icon

How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning

Add code
May 22, 2025
Figure 1 for How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
Figure 2 for How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
Figure 3 for How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
Figure 4 for How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning
Viaarxiv icon

VeRecycle: Reclaiming Guarantees from Probabilistic Certificates for Stochastic Dynamical Systems after Change

Add code
May 20, 2025
Figure 1 for VeRecycle: Reclaiming Guarantees from Probabilistic Certificates for Stochastic Dynamical Systems after Change
Figure 2 for VeRecycle: Reclaiming Guarantees from Probabilistic Certificates for Stochastic Dynamical Systems after Change
Figure 3 for VeRecycle: Reclaiming Guarantees from Probabilistic Certificates for Stochastic Dynamical Systems after Change
Figure 4 for VeRecycle: Reclaiming Guarantees from Probabilistic Certificates for Stochastic Dynamical Systems after Change
Viaarxiv icon

Trust-Region Twisted Policy Improvement

Add code
Apr 08, 2025
Figure 1 for Trust-Region Twisted Policy Improvement
Figure 2 for Trust-Region Twisted Policy Improvement
Figure 3 for Trust-Region Twisted Policy Improvement
Figure 4 for Trust-Region Twisted Policy Improvement
Viaarxiv icon

Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model

Add code
Mar 14, 2025
Figure 1 for Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Figure 2 for Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Figure 3 for Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Figure 4 for Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model
Viaarxiv icon