Picture for Claire Vernade

Claire Vernade

L2S

A Pontryagin Perspective on Reinforcement Learning

Add code
May 28, 2024
Viaarxiv icon

Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits

Add code
Feb 08, 2024
Figure 1 for Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Figure 2 for Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Figure 3 for Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Figure 4 for Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Viaarxiv icon

Beyond Average Return in Markov Decision Processes

Add code
Oct 31, 2023
Viaarxiv icon

POMRL: No-Regret Learning-to-Plan with Increasing Horizons

Add code
Dec 30, 2022
Figure 1 for POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Figure 2 for POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Figure 3 for POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Figure 4 for POMRL: No-Regret Learning-to-Plan with Increasing Horizons
Viaarxiv icon

Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms

Add code
Mar 13, 2022
Figure 1 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 2 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 3 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Figure 4 for Non-stationary Bandits and Meta-Learning with a Small Set of Optimal Arms
Viaarxiv icon

EigenGame Unloaded: When playing games is better than optimizing

Add code
Feb 08, 2021
Figure 1 for EigenGame Unloaded: When playing games is better than optimizing
Figure 2 for EigenGame Unloaded: When playing games is better than optimizing
Figure 3 for EigenGame Unloaded: When playing games is better than optimizing
Figure 4 for EigenGame Unloaded: When playing games is better than optimizing
Viaarxiv icon

Asymptotically Optimal Information-Directed Sampling

Add code
Nov 11, 2020
Figure 1 for Asymptotically Optimal Information-Directed Sampling
Figure 2 for Asymptotically Optimal Information-Directed Sampling
Figure 3 for Asymptotically Optimal Information-Directed Sampling
Figure 4 for Asymptotically Optimal Information-Directed Sampling
Viaarxiv icon

The Elliptical Potential Lemma Revisited

Add code
Oct 20, 2020
Viaarxiv icon

EigenGame: PCA as a Nash Equilibrium

Add code
Oct 01, 2020
Figure 1 for EigenGame: PCA as a Nash Equilibrium
Figure 2 for EigenGame: PCA as a Nash Equilibrium
Figure 3 for EigenGame: PCA as a Nash Equilibrium
Figure 4 for EigenGame: PCA as a Nash Equilibrium
Viaarxiv icon

Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting

Add code
Jun 18, 2020
Figure 1 for Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting
Figure 2 for Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting
Figure 3 for Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting
Figure 4 for Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting
Viaarxiv icon