Picture for Matteo Pirotta

Matteo Pirotta

Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees

Add code
Oct 24, 2022
Viaarxiv icon

Contextual bandits with concave rewards, and an application to fair ranking

Add code
Oct 18, 2022
Figure 1 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 2 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 3 for Contextual bandits with concave rewards, and an application to fair ranking
Figure 4 for Contextual bandits with concave rewards, and an application to fair ranking
Viaarxiv icon

Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path

Add code
Oct 10, 2022
Figure 1 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Figure 2 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Figure 3 for Reaching Goals is Hard: Settling the Sample Complexity of the Stochastic Shortest Path
Viaarxiv icon

Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations

Add code
Dec 14, 2021
Figure 1 for Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations
Figure 2 for Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations
Figure 3 for Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations
Figure 4 for Top $K$ Ranking for Multi-Armed Bandit with Noisy Evaluations
Viaarxiv icon

Privacy Amplification via Shuffling for Linear Contextual Bandits

Add code
Dec 11, 2021
Figure 1 for Privacy Amplification via Shuffling for Linear Contextual Bandits
Figure 2 for Privacy Amplification via Shuffling for Linear Contextual Bandits
Viaarxiv icon

Differentially Private Exploration in Reinforcement Learning with Linear Representation

Add code
Dec 07, 2021
Figure 1 for Differentially Private Exploration in Reinforcement Learning with Linear Representation
Viaarxiv icon

Adaptive Multi-Goal Exploration

Add code
Nov 23, 2021
Figure 1 for Adaptive Multi-Goal Exploration
Figure 2 for Adaptive Multi-Goal Exploration
Figure 3 for Adaptive Multi-Goal Exploration
Viaarxiv icon

Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection

Add code
Oct 27, 2021
Figure 1 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 2 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 3 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Figure 4 for Reinforcement Learning in Linear MDPs: Constant Regret and Representation Selection
Viaarxiv icon

A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs

Add code
Jun 24, 2021
Figure 1 for A Fully Problem-Dependent Regret Lower Bound for Finite-Horizon MDPs
Viaarxiv icon

A Unified Framework for Conservative Exploration

Add code
Jun 22, 2021
Figure 1 for A Unified Framework for Conservative Exploration
Viaarxiv icon