Picture for Pierre Ménard

Pierre Ménard

OVGU

Local and adaptive mirror descents in extensive-form games

Add code
Sep 01, 2023
Figure 1 for Local and adaptive mirror descents in extensive-form games
Viaarxiv icon

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Add code
May 22, 2023
Figure 1 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 2 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 3 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Figure 4 for Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice
Viaarxiv icon

Learning Generative Models with Goal-conditioned Reinforcement Learning

Add code
Mar 26, 2023
Figure 1 for Learning Generative Models with Goal-conditioned Reinforcement Learning
Figure 2 for Learning Generative Models with Goal-conditioned Reinforcement Learning
Figure 3 for Learning Generative Models with Goal-conditioned Reinforcement Learning
Figure 4 for Learning Generative Models with Goal-conditioned Reinforcement Learning
Viaarxiv icon

Adapting to game trees in zero-sum imperfect information games

Add code
Dec 23, 2022
Figure 1 for Adapting to game trees in zero-sum imperfect information games
Figure 2 for Adapting to game trees in zero-sum imperfect information games
Figure 3 for Adapting to game trees in zero-sum imperfect information games
Figure 4 for Adapting to game trees in zero-sum imperfect information games
Viaarxiv icon

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal

Add code
May 27, 2022
Figure 1 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 2 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 3 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Figure 4 for KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal
Viaarxiv icon

Indexed Minimum Empirical Divergence for Unimodal Bandits

Add code
Dec 02, 2021
Figure 1 for Indexed Minimum Empirical Divergence for Unimodal Bandits
Viaarxiv icon

Adaptive Multi-Goal Exploration

Add code
Nov 23, 2021
Figure 1 for Adaptive Multi-Goal Exploration
Figure 2 for Adaptive Multi-Goal Exploration
Figure 3 for Adaptive Multi-Goal Exploration
Viaarxiv icon

Problem Dependent View on Structured Thresholding Bandit Problems

Add code
Jun 18, 2021
Figure 1 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 2 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 3 for Problem Dependent View on Structured Thresholding Bandit Problems
Figure 4 for Problem Dependent View on Structured Thresholding Bandit Problems
Viaarxiv icon

Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall

Add code
Jun 11, 2021
Figure 1 for Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall
Viaarxiv icon

Bandits with many optimal arms

Add code
Mar 23, 2021
Figure 1 for Bandits with many optimal arms
Figure 2 for Bandits with many optimal arms
Figure 3 for Bandits with many optimal arms
Viaarxiv icon