Picture for Mirco Mutti

Mirco Mutti

The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough

Add code
Jun 18, 2024
Figure 1 for The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
Figure 2 for The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
Figure 3 for The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
Figure 4 for The Limits of Pure Exploration in POMDPs: When the Observation Entropy is Enough
Viaarxiv icon

How to Scale Inverse RL to Large State Spaces? A Provably Efficient Approach

Add code
Jun 06, 2024
Viaarxiv icon

How to Explore with Belief: State Entropy Maximization in POMDPs

Add code
Jun 04, 2024
Viaarxiv icon

Test-Time Regret Minimization in Meta Reinforcement Learning

Add code
Jun 04, 2024
Figure 1 for Test-Time Regret Minimization in Meta Reinforcement Learning
Figure 2 for Test-Time Regret Minimization in Meta Reinforcement Learning
Figure 3 for Test-Time Regret Minimization in Meta Reinforcement Learning
Viaarxiv icon

Offline Inverse RL: New Solution Concepts and Provably Efficient Algorithms

Add code
Feb 23, 2024
Viaarxiv icon

A Framework for Partially Observed Reward-States in RLHF

Add code
Feb 05, 2024
Viaarxiv icon

Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning

Add code
Oct 11, 2023
Figure 1 for Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Figure 2 for Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
Viaarxiv icon

A Tale of Sampling and Estimation in Discounted Reinforcement Learning

Add code
Apr 14, 2023
Figure 1 for A Tale of Sampling and Estimation in Discounted Reinforcement Learning
Figure 2 for A Tale of Sampling and Estimation in Discounted Reinforcement Learning
Figure 3 for A Tale of Sampling and Estimation in Discounted Reinforcement Learning
Viaarxiv icon

Reward-Free Policy Space Compression for Reinforcement Learning

Add code
Feb 22, 2022
Figure 1 for Reward-Free Policy Space Compression for Reinforcement Learning
Figure 2 for Reward-Free Policy Space Compression for Reinforcement Learning
Figure 3 for Reward-Free Policy Space Compression for Reinforcement Learning
Viaarxiv icon

Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization

Add code
Feb 14, 2022
Figure 1 for Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
Figure 2 for Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
Figure 3 for Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
Figure 4 for Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
Viaarxiv icon