Picture for Bruno Lacerda

Bruno Lacerda

A Finite-State Controller Based Offline Solver for Deterministic POMDPs

Add code
May 01, 2025
Viaarxiv icon

Return Capping: Sample-Efficient CVaR Policy Gradient Optimisation

Add code
Apr 29, 2025
Viaarxiv icon

No Regrets: Investigating and Improving Regret Approximations for Curriculum Discovery

Add code
Aug 27, 2024
Viaarxiv icon

Monte Carlo Tree Search with Boltzmann Exploration

Add code
Apr 11, 2024
Viaarxiv icon

JaxMARL: Multi-Agent RL Environments in JAX

Add code
Nov 20, 2023
Viaarxiv icon

A Framework for Learning from Demonstration with Minimal Human Effort

Add code
Jun 15, 2023
Viaarxiv icon

Formal Modelling for Multi-Robot Systems Under Uncertainty

Add code
May 26, 2023
Viaarxiv icon

One Risk to Rule Them All: A Risk-Sensitive Perspective on Model-Based Offline Reinforcement Learning

Add code
Nov 30, 2022
Viaarxiv icon

RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning

Add code
Apr 26, 2022
Figure 1 for RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning
Figure 2 for RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning
Figure 3 for RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning
Viaarxiv icon

Lexicographic Optimisation of Conditional Value at Risk and Expected Value for Risk-Averse Planning in MDPs

Add code
Oct 25, 2021
Figure 1 for Lexicographic Optimisation of Conditional Value at Risk and Expected Value for Risk-Averse Planning in MDPs
Figure 2 for Lexicographic Optimisation of Conditional Value at Risk and Expected Value for Risk-Averse Planning in MDPs
Figure 3 for Lexicographic Optimisation of Conditional Value at Risk and Expected Value for Risk-Averse Planning in MDPs
Figure 4 for Lexicographic Optimisation of Conditional Value at Risk and Expected Value for Risk-Averse Planning in MDPs
Viaarxiv icon