Picture for Wendelin Böhmer

Wendelin Böhmer

Universal Value-Function Uncertainties

Add code
May 27, 2025
Viaarxiv icon

How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning

Add code
May 22, 2025
Viaarxiv icon

Contextual Similarity Distillation: Ensemble Uncertainties with a Single Model

Add code
Mar 14, 2025
Viaarxiv icon

Training on more Reachable Tasks for Generalisation in Reinforcement Learning

Add code
Oct 04, 2024
Figure 1 for Training on more Reachable Tasks for Generalisation in Reinforcement Learning
Figure 2 for Training on more Reachable Tasks for Generalisation in Reinforcement Learning
Figure 3 for Training on more Reachable Tasks for Generalisation in Reinforcement Learning
Figure 4 for Training on more Reachable Tasks for Generalisation in Reinforcement Learning
Viaarxiv icon

Explore-Go: Leveraging Exploration for Generalisation in Deep Reinforcement Learning

Add code
Jun 12, 2024
Viaarxiv icon

A Penalty-Based Guardrail Algorithm for Non-Decreasing Optimization with Inequality Constraints

Add code
May 03, 2024
Figure 1 for A Penalty-Based Guardrail Algorithm for Non-Decreasing Optimization with Inequality Constraints
Figure 2 for A Penalty-Based Guardrail Algorithm for Non-Decreasing Optimization with Inequality Constraints
Figure 3 for A Penalty-Based Guardrail Algorithm for Non-Decreasing Optimization with Inequality Constraints
Figure 4 for A Penalty-Based Guardrail Algorithm for Non-Decreasing Optimization with Inequality Constraints
Viaarxiv icon

To the Max: Reinventing Reward in Reinforcement Learning

Add code
Feb 02, 2024
Figure 1 for To the Max: Reinventing Reward in Reinforcement Learning
Figure 2 for To the Max: Reinventing Reward in Reinforcement Learning
Figure 3 for To the Max: Reinventing Reward in Reinforcement Learning
Figure 4 for To the Max: Reinventing Reward in Reinforcement Learning
Viaarxiv icon

Multi-Robot Local Motion Planning Using Dynamic Optimization Fabrics

Add code
Oct 19, 2023
Viaarxiv icon

You Shall not Pass: the Zero-Gradient Problem in Predict and Optimize for Convex Optimization

Add code
Jul 30, 2023
Viaarxiv icon

Diverse Projection Ensembles for Distributional Reinforcement Learning

Add code
Jun 12, 2023
Figure 1 for Diverse Projection Ensembles for Distributional Reinforcement Learning
Figure 2 for Diverse Projection Ensembles for Distributional Reinforcement Learning
Figure 3 for Diverse Projection Ensembles for Distributional Reinforcement Learning
Figure 4 for Diverse Projection Ensembles for Distributional Reinforcement Learning
Viaarxiv icon