Picture for Alessandro Montenegro

Alessandro Montenegro

Reusing Trajectories in Policy Gradients Enables Fast Convergence

Add code
Jun 06, 2025
Viaarxiv icon

Learning Deterministic Policies with Policy Gradients in Constrained Markov Decision Processes

Add code
Jun 06, 2025
Viaarxiv icon

Last-Iterate Global Convergence of Policy Gradients for Constrained Reinforcement Learning

Add code
Jul 15, 2024
Viaarxiv icon

Learning Optimal Deterministic Policies with Stochastic Policy Gradients

Add code
May 03, 2024
Viaarxiv icon

Best Arm Identification for Stochastic Rising Bandits

Add code
Feb 15, 2023
Viaarxiv icon