Alert button
Picture for Michal Valko

Michal Valko

Alert button

Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall

Add code
Bookmark button
Alert button
Jun 11, 2021
Tadashi Kozuno, Pierre Ménard, Rémi Munos, Michal Valko

Figure 1 for Model-Free Learning for Two-Player Zero-Sum Partially Observable Markov Games with Perfect Recall
Viaarxiv icon

Taylor Expansion of Discount Factors

Add code
Bookmark button
Alert button
Jun 11, 2021
Yunhao Tang, Mark Rowland, Rémi Munos, Michal Valko

Figure 1 for Taylor Expansion of Discount Factors
Figure 2 for Taylor Expansion of Discount Factors
Figure 3 for Taylor Expansion of Discount Factors
Figure 4 for Taylor Expansion of Discount Factors
Viaarxiv icon

Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret

Add code
Bookmark button
Alert button
Apr 22, 2021
Jean Tarbouriech, Runlong Zhou, Simon S. Du, Matteo Pirotta, Michal Valko, Alessandro Lazaric

Figure 1 for Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret
Figure 2 for Stochastic Shortest Path: Minimax, Parameter-Free and Towards Horizon-Free Regret
Viaarxiv icon

Broaden Your Views for Self-Supervised Video Learning

Add code
Bookmark button
Alert button
Mar 30, 2021
Adrià Recasens, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Florian Strub, Corentin Tallec, Mateusz Malinowski, Viorica Patraucean, Florent Altché, Michal Valko, Jean-Bastien Grill, Aäron van den Oord, Andrew Zisserman

Figure 1 for Broaden Your Views for Self-Supervised Video Learning
Figure 2 for Broaden Your Views for Self-Supervised Video Learning
Figure 3 for Broaden Your Views for Self-Supervised Video Learning
Figure 4 for Broaden Your Views for Self-Supervised Video Learning
Viaarxiv icon

UCB Momentum Q-learning: Correcting the bias without forgetting

Add code
Bookmark button
Alert button
Mar 01, 2021
Pierre Menard, Omar Darwiche Domingues, Xuedong Shang, Michal Valko

Figure 1 for UCB Momentum Q-learning: Correcting the bias without forgetting
Figure 2 for UCB Momentum Q-learning: Correcting the bias without forgetting
Figure 3 for UCB Momentum Q-learning: Correcting the bias without forgetting
Viaarxiv icon

Revisiting Peng's Q($λ$) for Modern Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 27, 2021
Tadashi Kozuno, Yunhao Tang, Mark Rowland, Rémi Munos, Steven Kapturowski, Will Dabney, Michal Valko, David Abel

Figure 1 for Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Figure 2 for Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Figure 3 for Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Figure 4 for Revisiting Peng's Q($λ$) for Modern Reinforcement Learning
Viaarxiv icon

Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction

Add code
Bookmark button
Alert button
Feb 19, 2021
Mehdi Azabou, Mohammad Gheshlaghi Azar, Ran Liu, Chi-Heng Lin, Erik C. Johnson, Kiran Bhaskaran-Nair, Max Dabagia, Keith B. Hengen, William Gray-Roncal, Michal Valko, Eva L. Dyer

Figure 1 for Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction
Figure 2 for Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction
Figure 3 for Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction
Figure 4 for Mine Your Own vieW: Self-Supervised Learning Through Across-Sample Prediction
Viaarxiv icon

Bootstrapped Representation Learning on Graphs

Add code
Bookmark button
Alert button
Feb 12, 2021
Shantanu Thakoor, Corentin Tallec, Mohammad Gheshlaghi Azar, Rémi Munos, Petar Veličković, Michal Valko

Figure 1 for Bootstrapped Representation Learning on Graphs
Figure 2 for Bootstrapped Representation Learning on Graphs
Figure 3 for Bootstrapped Representation Learning on Graphs
Figure 4 for Bootstrapped Representation Learning on Graphs
Viaarxiv icon

Geometric Entropic Exploration

Add code
Bookmark button
Alert button
Jan 07, 2021
Zhaohan Daniel Guo, Mohammad Gheshlaghi Azar, Alaa Saade, Shantanu Thakoor, Bilal Piot, Bernardo Avila Pires, Michal Valko, Thomas Mesnard, Tor Lattimore, Rémi Munos

Figure 1 for Geometric Entropic Exploration
Figure 2 for Geometric Entropic Exploration
Figure 3 for Geometric Entropic Exploration
Figure 4 for Geometric Entropic Exploration
Viaarxiv icon