Picture for Mirco Mutti

Mirco Mutti

Geometric Active Exploration in Markov Decision Processes: the Benefit of Abstraction

Add code
Jul 18, 2024
Viaarxiv icon

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

Add code
Jun 18, 2024
Viaarxiv icon

How to Scale Inverse RL to Large State Spaces? A Provably Efficient Approach

Add code
Jun 06, 2024
Viaarxiv icon

Test-Time Regret Minimization in Meta Reinforcement Learning

Add code
Jun 04, 2024
Viaarxiv icon

How to Explore with Belief: State Entropy Maximization in POMDPs

Add code
Jun 04, 2024
Viaarxiv icon

Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms

Add code
Feb 23, 2024
Viaarxiv icon

A Framework for Partially Observed Reward-States in RLHF

Add code
Feb 05, 2024
Viaarxiv icon

Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning

Add code
Oct 11, 2023
Figure 1 for Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Figure 2 for Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Viaarxiv icon

A Tale of Sampling and Estimation in Discounted Reinforcement Learning

Add code
Apr 14, 2023
Viaarxiv icon

Reward-Free Policy Space Compression for Reinforcement Learning

Add code
Feb 22, 2022
Figure 1 for Reward-Free Policy Space Compression for Reinforcement Learning
Figure 2 for Reward-Free Policy Space Compression for Reinforcement Learning
Figure 3 for Reward-Free Policy Space Compression for Reinforcement Learning
Viaarxiv icon