Picture for Matteo Papini

Matteo Papini

Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning

Add code
Jul 15, 2024
Viaarxiv icon

Projection by Convolution: Optimal Sample Complexity for Reinforcement Learning in Continuous-Space MDPs

Add code
May 10, 2024
Viaarxiv icon

Policy Gradient with Active Importance Sampling

Add code
May 09, 2024
Viaarxiv icon

Learning Optimal Deterministic Policies with Stochastic Policy Gradients

Add code
May 03, 2024
Viaarxiv icon

Optimisic Information Directed Sampling

Add code
Feb 23, 2024
Viaarxiv icon

No-Regret Reinforcement Learning in Smooth MDPs

Add code
Feb 06, 2024
Viaarxiv icon

Importance-Weighted Offline Learning Done Right

Add code
Sep 27, 2023
Figure 1 for Importance-Weighted Offline Learning Done Right
Figure 2 for Importance-Weighted Offline Learning Done Right
Figure 3 for Importance-Weighted Offline Learning Done Right
Viaarxiv icon

Offline Primal-Dual Reinforcement Learning for Linear MDPs

Add code
May 22, 2023
Figure 1 for Offline Primal-Dual Reinforcement Learning for Linear MDPs
Viaarxiv icon

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

Add code
Oct 24, 2022
Figure 1 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 2 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 3 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Figure 4 for Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees
Viaarxiv icon

Online Learning with Off-Policy Feedback

Add code
Jul 18, 2022
Figure 1 for Online Learning with Off-Policy Feedback
Viaarxiv icon