Picture for Francesco Belardinelli

Francesco Belardinelli

Imperial College London

Safe Reinforcement Learning via Recovery-based Shielding with Gaussian Process Dynamics Models

Add code
Feb 17, 2026
Viaarxiv icon

Expressive Temporal Specifications for Reward Monitoring

Add code
Nov 16, 2025
Viaarxiv icon

Behaviour Policy Optimization: Provably Lower Variance Return Estimates for Off-Policy Reinforcement Learning

Add code
Nov 13, 2025
Viaarxiv icon

Multi-Agent Q-Learning Dynamics in Random Networks: Convergence due to Exploration and Sparsity

Add code
Mar 13, 2025
Viaarxiv icon

Probabilistic Shielding for Safe Reinforcement Learning

Add code
Mar 09, 2025
Figure 1 for Probabilistic Shielding for Safe Reinforcement Learning
Figure 2 for Probabilistic Shielding for Safe Reinforcement Learning
Figure 3 for Probabilistic Shielding for Safe Reinforcement Learning
Figure 4 for Probabilistic Shielding for Safe Reinforcement Learning
Viaarxiv icon

Explainable Reinforcement Learning for Formula One Race Strategy

Add code
Jan 07, 2025
Figure 1 for Explainable Reinforcement Learning for Formula One Race Strategy
Figure 2 for Explainable Reinforcement Learning for Formula One Race Strategy
Figure 3 for Explainable Reinforcement Learning for Formula One Race Strategy
Figure 4 for Explainable Reinforcement Learning for Formula One Race Strategy
Viaarxiv icon

Measuring Goal-Directedness

Add code
Dec 06, 2024
Figure 1 for Measuring Goal-Directedness
Figure 2 for Measuring Goal-Directedness
Figure 3 for Measuring Goal-Directedness
Figure 4 for Measuring Goal-Directedness
Viaarxiv icon

The Reasons that Agents Act: Intention and Instrumental Goals

Add code
Feb 15, 2024
Figure 1 for The Reasons that Agents Act: Intention and Instrumental Goals
Figure 2 for The Reasons that Agents Act: Intention and Instrumental Goals
Figure 3 for The Reasons that Agents Act: Intention and Instrumental Goals
Figure 4 for The Reasons that Agents Act: Intention and Instrumental Goals
Viaarxiv icon

Leveraging Approximate Model-based Shielding for Probabilistic Safety Guarantees in Continuous Environments

Add code
Feb 01, 2024
Viaarxiv icon

Stability of Multi-Agent Learning in Competitive Networks: Delaying the Onset of Chaos

Add code
Dec 19, 2023
Viaarxiv icon