Alert button
Picture for Maxime Gazeau

Maxime Gazeau

Alert button

Vision-Language Models as a Source of Rewards

Add code
Bookmark button
Alert button
Dec 14, 2023
Kate Baumli, Satinder Baveja, Feryal Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald, Luyu Wang, Lei Zhang

Viaarxiv icon

In-context Reinforcement Learning with Algorithm Distillation

Add code
Bookmark button
Alert button
Oct 25, 2022
Michael Laskin, Luyu Wang, Junhyuk Oh, Emilio Parisotto, Stephen Spencer, Richie Steigerwald, DJ Strouse, Steven Hansen, Angelos Filos, Ethan Brooks, Maxime Gazeau, Himanshu Sahni, Satinder Singh, Volodymyr Mnih

Figure 1 for In-context Reinforcement Learning with Algorithm Distillation
Figure 2 for In-context Reinforcement Learning with Algorithm Distillation
Figure 3 for In-context Reinforcement Learning with Algorithm Distillation
Figure 4 for In-context Reinforcement Learning with Algorithm Distillation
Viaarxiv icon

Higher Order Generalization Error for First Order Discretization of Langevin Diffusion

Add code
Bookmark button
Alert button
Feb 11, 2021
Mufan Bill Li, Maxime Gazeau

Figure 1 for Higher Order Generalization Error for First Order Discretization of Langevin Diffusion
Figure 2 for Higher Order Generalization Error for First Order Discretization of Langevin Diffusion
Figure 3 for Higher Order Generalization Error for First Order Discretization of Langevin Diffusion
Viaarxiv icon

Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise

Add code
Bookmark button
Alert button
Apr 03, 2019
Yeming Wen, Kevin Luk, Maxime Gazeau, Guodong Zhang, Harris Chan, Jimmy Ba

Figure 1 for Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise
Figure 2 for Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise
Figure 3 for Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise
Figure 4 for Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise
Viaarxiv icon

A general system of differential equations to model first order adaptive algorithms

Add code
Bookmark button
Alert button
Oct 31, 2018
André Belotto da Silva, Maxime Gazeau

Figure 1 for A general system of differential equations to model first order adaptive algorithms
Figure 2 for A general system of differential equations to model first order adaptive algorithms
Figure 3 for A general system of differential equations to model first order adaptive algorithms
Figure 4 for A general system of differential equations to model first order adaptive algorithms
Viaarxiv icon

Scalable Recommender Systems through Recursive Evidence Chains

Add code
Bookmark button
Alert button
Jul 05, 2018
Elias Tragas, Calvin Luo, Maxime Gazeau, Kevin Luk, David Duvenaud

Figure 1 for Scalable Recommender Systems through Recursive Evidence Chains
Figure 2 for Scalable Recommender Systems through Recursive Evidence Chains
Viaarxiv icon

Implicit Manifold Learning on Generative Adversarial Networks

Add code
Bookmark button
Alert button
Oct 30, 2017
Kry Yik Chau Lui, Yanshuai Cao, Maxime Gazeau, Kelvin Shuangjian Zhang

Figure 1 for Implicit Manifold Learning on Generative Adversarial Networks
Viaarxiv icon