Alert button
Picture for Pierre Gaillard

Pierre Gaillard

Alert button

Thoth

Stop Relying on No-Choice and Do not Repeat the Moves: Optimal, Efficient and Practical Algorithms for Assortment Optimization

Add code
Bookmark button
Alert button
Feb 29, 2024
Aadirupa Saha, Pierre Gaillard

Viaarxiv icon

Covariance-Adaptive Least-Squares Algorithm for Stochastic Combinatorial Semi-Bandits

Add code
Bookmark button
Alert button
Feb 23, 2024
Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel

Viaarxiv icon

Online Learning Approach for Survival Analysis

Add code
Bookmark button
Alert button
Feb 07, 2024
Camila Fernandez, Pierre Gaillard, Joseph de Vilmarest, Olivier Wintenberger

Viaarxiv icon

Efficient Model-Based Concave Utility Reinforcement Learning through Greedy Mirror Descent

Add code
Bookmark button
Alert button
Nov 30, 2023
Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane

Viaarxiv icon

Adaptive approximation of monotone functions

Add code
Bookmark button
Alert button
Sep 14, 2023
Pierre Gaillard, Sébastien Gerchinovitz, Étienne de Montbrun

Viaarxiv icon

Sequential Counterfactual Risk Minimization

Add code
Bookmark button
Alert button
Feb 23, 2023
Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard

Figure 1 for Sequential Counterfactual Risk Minimization
Figure 2 for Sequential Counterfactual Risk Minimization
Figure 3 for Sequential Counterfactual Risk Minimization
Figure 4 for Sequential Counterfactual Risk Minimization
Viaarxiv icon

A mirror descent approach for Mean Field Control applied to Demande-Side management

Add code
Bookmark button
Alert button
Feb 16, 2023
Bianca Marin Moreno, Margaux Brégère, Pierre Gaillard, Nadia Oudjane

Figure 1 for A mirror descent approach for Mean Field Control applied to Demande-Side management
Figure 2 for A mirror descent approach for Mean Field Control applied to Demande-Side management
Figure 3 for A mirror descent approach for Mean Field Control applied to Demande-Side management
Figure 4 for A mirror descent approach for Mean Field Control applied to Demande-Side management
Viaarxiv icon

One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits

Add code
Bookmark button
Alert button
Oct 26, 2022
Pierre Gaillard, Aadirupa Saha, Soham Dan

Figure 1 for One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits
Figure 2 for One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits
Figure 3 for One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits
Figure 4 for One Arrow, Two Kills: An Unified Framework for Achieving Optimal Regret Guarantees in Sleeping Bandits
Viaarxiv icon

Versatile Dueling Bandits: Best-of-both-World Analyses for Online Learning from Preferences

Add code
Bookmark button
Alert button
Feb 14, 2022
Aadirupa Saha, Pierre Gaillard

Viaarxiv icon

Efficient Kernel UCB for Contextual Bandits

Add code
Bookmark button
Alert button
Feb 11, 2022
Houssam Zenati, Alberto Bietti, Eustache Diemert, Julien Mairal, Matthieu Martin, Pierre Gaillard

Figure 1 for Efficient Kernel UCB for Contextual Bandits
Figure 2 for Efficient Kernel UCB for Contextual Bandits
Figure 3 for Efficient Kernel UCB for Contextual Bandits
Figure 4 for Efficient Kernel UCB for Contextual Bandits
Viaarxiv icon