Alert button
Picture for Pierre Clavier

Pierre Clavier

Alert button

VITS : Variational Inference Thomson Sampling for contextual bandits

Add code
Bookmark button
Alert button
Jul 19, 2023
Pierre Clavier, Tom Huix, Alain Durmus

Viaarxiv icon

Towards Minimax Optimality of Model-based Robust Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 10, 2023
Pierre Clavier, Erwan Le Pennec, Matthieu Geist

Figure 1 for Towards Minimax Optimality of Model-based Robust Reinforcement Learning
Viaarxiv icon

Robust Reinforcement Learning with Distributional Risk-averse formulation

Add code
Bookmark button
Alert button
Jun 14, 2022
Pierre Clavier, Stéphanie Allassonière, Erwan Le Pennec

Figure 1 for Robust Reinforcement Learning with Distributional Risk-averse formulation
Figure 2 for Robust Reinforcement Learning with Distributional Risk-averse formulation
Figure 3 for Robust Reinforcement Learning with Distributional Risk-averse formulation
Figure 4 for Robust Reinforcement Learning with Distributional Risk-averse formulation
Viaarxiv icon