Picture for Alessandro Abate

Alessandro Abate

University of Oxford

Walking the Values in Bayesian Inverse Reinforcement Learning

Add code
Jul 15, 2024
Viaarxiv icon

The Perils of Optimizing Learned Reward Functions: Low Training Error Does Not Guarantee Low Regret

Add code
Jun 22, 2024
Viaarxiv icon

Deep Bayesian Active Learning for Preference Modeling in Large Language Models

Add code
Jun 14, 2024
Figure 1 for Deep Bayesian Active Learning for Preference Modeling in Large Language Models
Figure 2 for Deep Bayesian Active Learning for Preference Modeling in Large Language Models
Figure 3 for Deep Bayesian Active Learning for Preference Modeling in Large Language Models
Figure 4 for Deep Bayesian Active Learning for Preference Modeling in Large Language Models
Viaarxiv icon

Bisimulation Learning

Add code
May 24, 2024
Viaarxiv icon

Towards Guaranteed Safe AI: A Framework for Ensuring Robust and Reliable AI Systems

Add code
May 10, 2024
Viaarxiv icon

Safe Reach Set Computation via Neural Barrier Certificates

Add code
Apr 29, 2024
Figure 1 for Safe Reach Set Computation via Neural Barrier Certificates
Figure 2 for Safe Reach Set Computation via Neural Barrier Certificates
Figure 3 for Safe Reach Set Computation via Neural Barrier Certificates
Figure 4 for Safe Reach Set Computation via Neural Barrier Certificates
Viaarxiv icon

Quantifying the Sensitivity of Inverse Reinforcement Learning to Misspecification

Add code
Mar 11, 2024
Viaarxiv icon

Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers

Add code
Jan 29, 2024
Figure 1 for Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers
Figure 2 for Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers
Figure 3 for Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers
Figure 4 for Distributed Markov Chain Monte Carlo Sampling based on the Alternating Direction Method of Multipliers
Viaarxiv icon

On the Limitations of Markovian Rewards to Express Multi-Objective, Risk-Sensitive, and Modal Tasks

Add code
Jan 26, 2024
Viaarxiv icon

Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis

Add code
Dec 18, 2023
Figure 1 for Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis
Figure 2 for Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis
Figure 3 for Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis
Figure 4 for Safeguarded Progress in Reinforcement Learning: Safe Bayesian Exploration for Control Policy Synthesis
Viaarxiv icon