Picture for Marcello Restelli

Marcello Restelli

From Parameters to Behavior: Unsupervised Compression of the Policy Space

Add code
Sep 26, 2025
Viaarxiv icon

Limitations of Physics-Informed Neural Networks: a Study on Smart Grid Surrogation

Add code
Aug 29, 2025
Viaarxiv icon

"So, Tell Me About Your Policy...": Distillation of interpretable policies from Deep Reinforcement Learning agents

Add code
Jul 10, 2025
Viaarxiv icon

Enhancing Diversity in Parallel Agents: A Maximum State Entropy Exploration Story

Add code
May 02, 2025
Viaarxiv icon

Towards Principled Multi-Agent Task Agnostic Exploration

Add code
Feb 12, 2025
Viaarxiv icon

Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models

Add code
Jan 30, 2025
Figure 1 for Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
Figure 2 for Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
Figure 3 for Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
Figure 4 for Achieving $\widetilde{\mathcal{O}}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
Viaarxiv icon

A parametric algorithm is optimal for non-parametric regression of smooth functions

Add code
Dec 19, 2024
Figure 1 for A parametric algorithm is optimal for non-parametric regression of smooth functions
Figure 2 for A parametric algorithm is optimal for non-parametric regression of smooth functions
Figure 3 for A parametric algorithm is optimal for non-parametric regression of smooth functions
Figure 4 for A parametric algorithm is optimal for non-parametric regression of smooth functions
Viaarxiv icon

Statistical Analysis of Policy Space Compression Problem

Add code
Nov 15, 2024
Viaarxiv icon

A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics

Add code
Nov 08, 2024
Viaarxiv icon

Local Linearity: the Key for No-regret Reinforcement Learning in Continuous MDPs

Add code
Oct 31, 2024
Viaarxiv icon