Alert button
Picture for Pierre Ménard

Pierre Ménard

Alert button

OVGU

Local and adaptive mirror descents in extensive-form games

Sep 01, 2023
Côme Fiegel, Pierre Ménard, Tadashi Kozuno, Rémi Munos, Vianney Perchet, Michal Valko

Figure 1 for Local and adaptive mirror descents in extensive-form games
Viaarxiv icon

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

May 22, 2023
Toshinori Kitamura, Tadashi Kozuno, Yunhao Tang, Nino Vieillard, Michal Valko, Wenhao Yang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári, Wataru Kumagai, Yutaka Matsuo

Figure 1 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 2 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 3 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 4 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Viaarxiv icon

Learning Generative Models with Goal-conditioned Reinforcement Learning

Mar 26, 2023
Mariana Vargas Vieyra, Pierre Ménard

Figure 1 for Learning Generative Models with Goal-conditioned Reinforcement Learning
Figure 2 for Learning Generative Models with Goal-conditioned Reinforcement Learning
Figure 3 for Learning Generative Models with Goal-conditioned Reinforcement Learning
Figure 4 for Learning Generative Models with Goal-conditioned Reinforcement Learning
Viaarxiv icon

Adapting to game trees in zero-sum imperfect information games

Dec 23, 2022
Côme Fiegel, Pierre Ménard, Tadashi Kozuno, Rémi Munos, Vianney Perchet, Michal Valko

Figure 1 for Adapting to game trees in zero-sum imperfect information games
Figure 2 for Adapting to game trees in zero-sum imperfect information games
Figure 3 for Adapting to game trees in zero-sum imperfect information games
Figure 4 for Adapting to game trees in zero-sum imperfect information games
Viaarxiv icon

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal

May 27, 2022
Tadashi Kozuno, Wenhao Yang, Nino Vieillard, Toshinori Kitamura, Yunhao Tang, Jincheng Mei, Pierre Ménard, Mohammad Gheshlaghi Azar, Michal Valko, Rémi Munos, Olivier Pietquin, Matthieu Geist, Csaba Szepesvári

Figure 1 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 2 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 3 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 4 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Viaarxiv icon

Indexed Minimum Empirical Divergence for Unimodal Bandits

Dec 02, 2021
Hassan Saber, Pierre Ménard, Odalric-Ambrym Maillard

Figure 1 for Indexed Minimum Empirical Divergence for Unimodal Bandits
Viaarxiv icon

Adaptive Multi-Goal Exploration

Nov 23, 2021
Jean Tarbouriech, Omar Darwiche Domingues, Pierre Ménard, Matteo Pirotta, Michal Valko, Alessandro Lazaric

Figure 1 for Adaptive Multi-Goal Exploration
Figure 2 for Adaptive Multi-Goal Exploration
Figure 3 for Adaptive Multi-Goal Exploration
Viaarxiv icon

Problem Dependent View on Structured Thresholding Bandit Problems

Jun 18, 2021
James Cheshire, Pierre Ménard, Alexandra Carpentier

Figure 1 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 2 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 3 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 4 for Problem Dependent View on Structured Thresholding Bandit Problems
Viaarxiv icon

Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall

Jun 11, 2021
Tadashi Kozuno, Pierre Ménard, Rémi Munos, Michal Valko

Figure 1 for Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall
Viaarxiv icon

Bandits with many optimal arms

Mar 23, 2021
Rianne de Heide, James Cheshire, Pierre Ménard, Alexandra Carpentier

Figure 1 for Bandits with many optimal arms
Figure 2 for Bandits with many optimal arms
Figure 3 for Bandits with many optimal arms
Viaarxiv icon