Alert button
Picture for Odalric-Ambrym Maillard

Odalric-Ambrym Maillard

Alert button

CRIStAL

CRIMED: Lower and Upper Bounds on Regret for Bandits with Unbounded Stochastic Corruption

Add code
Bookmark button
Alert button
Sep 28, 2023
Shubhada Agrawal, Timothée Mathieu, Debabrota Basu, Odalric-Ambrym Maillard

Viaarxiv icon

Monte-Carlo tree search with uncertainty propagation via optimal transport

Add code
Bookmark button
Alert button
Sep 19, 2023
Tuan Dam, Pascal Stenger, Lukas Schneider, Joni Pajarinen, Carlo D'Eramo, Odalric-Ambrym Maillard

Figure 1 for Monte-Carlo tree search with uncertainty propagation via optimal transport
Figure 2 for Monte-Carlo tree search with uncertainty propagation via optimal transport
Figure 3 for Monte-Carlo tree search with uncertainty propagation via optimal transport
Figure 4 for Monte-Carlo tree search with uncertainty propagation via optimal transport
Viaarxiv icon

AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents

Add code
Bookmark button
Alert button
Jun 19, 2023
Timothée Mathieu, Riccardo Della Vecchia, Alena Shilova, Matheus Centa de Medeiros, Hector Kohler, Odalric-Ambrym Maillard, Philippe Preux

Figure 1 for AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents
Figure 2 for AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents
Figure 3 for AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents
Figure 4 for AdaStop: sequential testing for efficient and reliable comparisons of Deep RL Agents
Viaarxiv icon

Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning

Add code
Bookmark button
Alert button
Oct 05, 2022
Reda Ouhamma, Debabrota Basu, Odalric-Ambrym Maillard

Figure 1 for Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning
Figure 2 for Bilinear Exponential Family of MDPs: Frequentist Regret Bound with Tractable Exploration and Planning
Viaarxiv icon

Risk-aware linear bandits with convex loss

Add code
Bookmark button
Alert button
Sep 15, 2022
Patrick Saux, Odalric-Ambrym Maillard

Figure 1 for Risk-aware linear bandits with convex loss
Figure 2 for Risk-aware linear bandits with convex loss
Figure 3 for Risk-aware linear bandits with convex loss
Figure 4 for Risk-aware linear bandits with convex loss
Viaarxiv icon

Collaborative Algorithms for Online Personalized Mean Estimation

Add code
Bookmark button
Alert button
Aug 24, 2022
Mahsa Asadi, Aurélien Bellet, Odalric-Ambrym Maillard, Marc Tommasi

Figure 1 for Collaborative Algorithms for Online Personalized Mean Estimation
Figure 2 for Collaborative Algorithms for Online Personalized Mean Estimation
Figure 3 for Collaborative Algorithms for Online Personalized Mean Estimation
Figure 4 for Collaborative Algorithms for Online Personalized Mean Estimation
Viaarxiv icon

Bandits Corrupted by Nature: Lower Bounds on Regret and Robust Optimistic Algorithm

Add code
Bookmark button
Alert button
Mar 07, 2022
Debabrota Basu, Odalric-Ambrym Maillard, Timothée Mathieu

Figure 1 for Bandits Corrupted by Nature: Lower Bounds on Regret and Robust Optimistic Algorithm
Figure 2 for Bandits Corrupted by Nature: Lower Bounds on Regret and Robust Optimistic Algorithm
Figure 3 for Bandits Corrupted by Nature: Lower Bounds on Regret and Robust Optimistic Algorithm
Viaarxiv icon

Bregman Deviations of Generic Exponential Families

Add code
Bookmark button
Alert button
Jan 18, 2022
Sayak Ray Chowdhury, Patrick Saux, Odalric-Ambrym Maillard, Aditya Gopalan

Figure 1 for Bregman Deviations of Generic Exponential Families
Figure 2 for Bregman Deviations of Generic Exponential Families
Figure 3 for Bregman Deviations of Generic Exponential Families
Figure 4 for Bregman Deviations of Generic Exponential Families
Viaarxiv icon

Indexed Minimum Empirical Divergence for Unimodal Bandits

Add code
Bookmark button
Alert button
Dec 02, 2021
Hassan Saber, Pierre Ménard, Odalric-Ambrym Maillard

Figure 1 for Indexed Minimum Empirical Divergence for Unimodal Bandits
Viaarxiv icon

From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits

Add code
Bookmark button
Alert button
Nov 18, 2021
Dorian Baudry, Patrick Saux, Odalric-Ambrym Maillard

Figure 1 for From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits
Figure 2 for From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits
Figure 3 for From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits
Figure 4 for From Optimality to Robustness: Dirichlet Sampling Strategies in Stochastic Bandits
Viaarxiv icon