Alert button
Picture for Michal Valko

Michal Valko

Alert button

Stochastic bandits with arm-dependent delays

Add code
Bookmark button
Alert button
Jun 18, 2020
Anne Gael Manegueu, Claire Vernade, Alexandra Carpentier, Michal Valko

Figure 1 for Stochastic bandits with arm-dependent delays
Figure 2 for Stochastic bandits with arm-dependent delays
Figure 3 for Stochastic bandits with arm-dependent delays
Figure 4 for Stochastic bandits with arm-dependent delays
Viaarxiv icon

Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning

Add code
Bookmark button
Alert button
Jun 13, 2020
Jean-Bastien Grill, Florian Strub, Florent Altché, Corentin Tallec, Pierre H. Richemond, Elena Buchatskaya, Carl Doersch, Bernardo Avila Pires, Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Bilal Piot, Koray Kavukcuoglu, Rémi Munos, Michal Valko

Figure 1 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 2 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 3 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Figure 4 for Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
Viaarxiv icon

Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits

Add code
Bookmark button
Alert button
Jun 11, 2020
Pierre Perrault, Etienne Boursier, Vianney Perchet, Michal Valko

Figure 1 for Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits
Figure 2 for Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits
Figure 3 for Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits
Figure 4 for Statistical Efficiency of Thompson Sampling for Combinatorial Semi-Bandits
Viaarxiv icon

Adaptive Reward-Free Exploration

Add code
Bookmark button
Alert button
Jun 11, 2020
Emilie Kaufmann, Pierre Ménard, Omar Darwiche Domingues, Anders Jonsson, Edouard Leurent, Michal Valko

Figure 1 for Adaptive Reward-Free Exploration
Figure 2 for Adaptive Reward-Free Exploration
Figure 3 for Adaptive Reward-Free Exploration
Viaarxiv icon

Planning in Markov Decision Processes with Gap-Dependent Sample Complexity

Add code
Bookmark button
Alert button
Jun 10, 2020
Anders Jonsson, Emilie Kaufmann, Pierre Ménard, Omar Darwiche Domingues, Edouard Leurent, Michal Valko

Figure 1 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 2 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 3 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Figure 4 for Planning in Markov Decision Processes with Gap-Dependent Sample Complexity
Viaarxiv icon

Regret Bounds for Kernel-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 12, 2020
Omar Darwiche Domingues, Pierre Ménard, Matteo Pirotta, Emilie Kaufmann, Michal Valko

Figure 1 for Regret Bounds for Kernel-Based Reinforcement Learning
Figure 2 for Regret Bounds for Kernel-Based Reinforcement Learning
Figure 3 for Regret Bounds for Kernel-Based Reinforcement Learning
Viaarxiv icon

Taylor Expansion Policy Optimization

Add code
Bookmark button
Alert button
Mar 13, 2020
Yunhao Tang, Michal Valko, Rémi Munos

Figure 1 for Taylor Expansion Policy Optimization
Figure 2 for Taylor Expansion Policy Optimization
Figure 3 for Taylor Expansion Policy Optimization
Figure 4 for Taylor Expansion Policy Optimization
Viaarxiv icon

Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification

Add code
Bookmark button
Alert button
Feb 26, 2020
Daniele Calandriello, Luigi Carratino, Alessandro Lazaric, Michal Valko, Lorenzo Rosasco

Figure 1 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 2 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 3 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Figure 4 for Near-linear Time Gaussian Process Optimization with Adaptive Batching and Resparsification
Viaarxiv icon

No-Regret Exploration in Goal-Oriented Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 30, 2020
Jean Tarbouriech, Evrard Garcelon, Michal Valko, Matteo Pirotta, Alessandro Lazaric

Figure 1 for No-Regret Exploration in Goal-Oriented Reinforcement Learning
Figure 2 for No-Regret Exploration in Goal-Oriented Reinforcement Learning
Figure 3 for No-Regret Exploration in Goal-Oriented Reinforcement Learning
Figure 4 for No-Regret Exploration in Goal-Oriented Reinforcement Learning
Viaarxiv icon