Picture for Matteo Pirotta

Matteo Pirotta

Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations

Add code
Dec 14, 2021
Figure 1 for Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations
Figure 2 for Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations
Figure 3 for Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations
Figure 4 for Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations
Viaarxiv icon

Privacy Amplification via Shuffling for Linear Contextual Bandits

Add code
Dec 11, 2021
Figure 1 for Privacy Amplification via Shuffling for Linear Contextual Bandits
Figure 2 for Privacy Amplification via Shuffling for Linear Contextual Bandits
Viaarxiv icon

Differentially Private Exploration in Reinforcement Learning with Linear Representation

Add code
Dec 07, 2021
Figure 1 for Differentially Private Exploration in Reinforcement Learning with Linear Representation
Viaarxiv icon

Adaptive Multi-Goal Exploration

Add code
Nov 23, 2021
Figure 1 for Adaptive Multi-Goal Exploration
Figure 2 for Adaptive Multi-Goal Exploration
Figure 3 for Adaptive Multi-Goal Exploration
Viaarxiv icon

Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

Add code
Oct 27, 2021
Figure 1 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 2 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 3 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 4 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Viaarxiv icon

A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs

Add code
Jun 24, 2021
Figure 1 for A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs
Viaarxiv icon

A Unified Framework for Conservative Exploration

Add code
Jun 22, 2021
Figure 1 for A Unified Framework for Conservative Exploration
Viaarxiv icon

Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

Add code
Apr 22, 2021
Figure 1 for Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret
Figure 2 for Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret
Viaarxiv icon

Leveraging Good Representations in Linear Contextual Bandits

Add code
Apr 08, 2021
Figure 1 for Leveraging Good Representations in Linear Contextual Bandits
Figure 2 for Leveraging Good Representations in Linear Contextual Bandits
Figure 3 for Leveraging Good Representations in Linear Contextual Bandits
Figure 4 for Leveraging Good Representations in Linear Contextual Bandits
Viaarxiv icon

Homomorphically Encrypted Linear Contextual Bandit

Add code
Mar 17, 2021
Figure 1 for Homomorphically Encrypted Linear Contextual Bandit
Figure 2 for Homomorphically Encrypted Linear Contextual Bandit
Figure 3 for Homomorphically Encrypted Linear Contextual Bandit
Viaarxiv icon