Alert button
Picture for Alessandro Lazaric

Alessandro Lazaric

Alert button

Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation

Add code
Bookmark button
Alert button
Jul 13, 2020
Marc Abeille, Alessandro Lazaric

Figure 1 for Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation
Figure 2 for Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation
Figure 3 for Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation
Figure 4 for Efficient Optimistic Exploration in Linear-Quadratic Regulators via Lagrangian Relaxation
Viaarxiv icon

A Provably Efficient Sample Collection Strategy for Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 13, 2020
Jean Tarbouriech, Matteo Pirotta, Michal Valko, Alessandro Lazaric

Figure 1 for A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Figure 2 for A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Figure 3 for A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Figure 4 for A Provably Efficient Sample Collection Strategy for Reinforcement Learning
Viaarxiv icon

Improved Analysis of UCRL2 with Empirical Bernstein Inequality

Add code
Bookmark button
Alert button
Jul 10, 2020
Ronan Fruit, Matteo Pirotta, Alessandro Lazaric

Figure 1 for Improved Analysis of UCRL2 with Empirical Bernstein Inequality
Viaarxiv icon

A Novel Confidence-Based Algorithm for Structured Bandits

Add code
Bookmark button
Alert button
May 23, 2020
Andrea Tirinzoni, Alessandro Lazaric, Marcello Restelli

Figure 1 for A Novel Confidence-Based Algorithm for Structured Bandits
Figure 2 for A Novel Confidence-Based Algorithm for Structured Bandits
Figure 3 for A Novel Confidence-Based Algorithm for Structured Bandits
Figure 4 for A Novel Confidence-Based Algorithm for Structured Bandits
Viaarxiv icon

Meta-learning with Stochastic Linear Bandits

Add code
Bookmark button
Alert button
May 18, 2020
Leonardo Cella, Alessandro Lazaric, Massimiliano Pontil

Figure 1 for Meta-learning with Stochastic Linear Bandits
Figure 2 for Meta-learning with Stochastic Linear Bandits
Figure 3 for Meta-learning with Stochastic Linear Bandits
Figure 4 for Meta-learning with Stochastic Linear Bandits
Viaarxiv icon

Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization

Add code
Bookmark button
Alert button
May 06, 2020
Pierre-Alexandre Kamienny, Matteo Pirotta, Alessandro Lazaric, Thibault Lavril, Nicolas Usunier, Ludovic Denoyer

Figure 1 for Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Figure 2 for Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Figure 3 for Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Figure 4 for Learning Adaptive Exploration Strategies in Dynamic Environments Through Informed Policy Regularization
Viaarxiv icon

Active Model Estimation in Markov Decision Processes

Add code
Bookmark button
Alert button
Mar 06, 2020
Jean Tarbouriech, Shubhanshu Shekhar, Matteo Pirotta, Mohammad Ghavamzadeh, Alessandro Lazaric

Figure 1 for Active Model Estimation in Markov Decision Processes
Figure 2 for Active Model Estimation in Markov Decision Processes
Figure 3 for Active Model Estimation in Markov Decision Processes
Figure 4 for Active Model Estimation in Markov Decision Processes
Viaarxiv icon

Learning Near Optimal Policies with Low Inherent Bellman Error

Add code
Bookmark button
Alert button
Mar 05, 2020
Andrea Zanette, Alessandro Lazaric, Mykel Kochenderfer, Emma Brunskill

Figure 1 for Learning Near Optimal Policies with Low Inherent Bellman Error
Viaarxiv icon

Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification

Add code
Bookmark button
Alert button
Feb 26, 2020
Daniele Calandriello, Luigi Carratino, Alessandro Lazaric, Michal Valko, Lorenzo Rosasco

Figure 1 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 2 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 3 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 4 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Viaarxiv icon