Alert button
Picture for Matt Hoffman

Matt Hoffman

Alert button

An Empirical Study of Implicit Regularization in Deep Offline RL

Add code
Bookmark button
Alert button
Jul 07, 2022
Caglar Gulcehre, Srivatsan Srinivasan, Jakub Sygnowski, Georg Ostrovski, Mehrdad Farajtabar, Matt Hoffman, Razvan Pascanu, Arnaud Doucet

Figure 1 for An Empirical Study of Implicit Regularization in Deep Offline RL
Figure 2 for An Empirical Study of Implicit Regularization in Deep Offline RL
Figure 3 for An Empirical Study of Implicit Regularization in Deep Offline RL
Figure 4 for An Empirical Study of Implicit Regularization in Deep Offline RL
Viaarxiv icon

Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach

Add code
Bookmark button
Alert button
Apr 22, 2022
Bobak Shahriari, Abbas Abdolmaleki, Arunkumar Byravan, Abe Friesen, Siqi Liu, Jost Tobias Springenberg, Nicolas Heess, Matt Hoffman, Martin Riedmiller

Figure 1 for Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Figure 2 for Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Figure 3 for Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Figure 4 for Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach
Viaarxiv icon

Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach

Add code
Bookmark button
Alert button
Apr 21, 2022
Bobak Shahriari, Abbas Abdolmaleki, Arunkumar Byravan, Abe Friesen, Siqi Liu, Jost Tobias Springenberg, Nicolas Heess, Matt Hoffman, Martin Riedmiller

Figure 1 for Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach
Figure 2 for Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach
Figure 3 for Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach
Figure 4 for Revisiting Gaussian mixture critic in off-policy reinforcement learning: a sample-based approach
Viaarxiv icon

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

Figure 1 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 2 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 3 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 4 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Viaarxiv icon

Acme: A Research Framework for Distributed Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 01, 2020
Matt Hoffman, Bobak Shahriari, John Aslanides, Gabriel Barth-Maron, Feryal Behbahani, Tamara Norman, Abbas Abdolmaleki, Albin Cassirer, Fan Yang, Kate Baumli, Sarah Henderson, Alex Novikov, Sergio Gómez Colmenarejo, Serkan Cabi, Caglar Gulcehre, Tom Le Paine, Andrew Cowie, Ziyu Wang, Bilal Piot, Nando de Freitas

Figure 1 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 2 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 3 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 4 for Acme: A Research Framework for Distributed Reinforcement Learning
Viaarxiv icon

Improving the Gating Mechanism of Recurrent Neural Networks

Add code
Bookmark button
Alert button
Oct 22, 2019
Albert Gu, Caglar Gulcehre, Tom Le Paine, Matt Hoffman, Razvan Pascanu

Figure 1 for Improving the Gating Mechanism of Recurrent Neural Networks
Figure 2 for Improving the Gating Mechanism of Recurrent Neural Networks
Figure 3 for Improving the Gating Mechanism of Recurrent Neural Networks
Figure 4 for Improving the Gating Mechanism of Recurrent Neural Networks
Viaarxiv icon

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

Add code
Bookmark button
Alert button
Sep 03, 2019
Tom Le Paine, Caglar Gulcehre, Bobak Shahriari, Misha Denil, Matt Hoffman, Hubert Soyer, Richard Tanburn, Steven Kapturowski, Neil Rabinowitz, Duncan Williams, Gabriel Barth-Maron, Ziyu Wang, Nando de Freitas, Worlds Team

Figure 1 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 2 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 3 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 4 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Viaarxiv icon

TensorFlow Distributions

Add code
Bookmark button
Alert button
Nov 28, 2017
Joshua V. Dillon, Ian Langmore, Dustin Tran, Eugene Brevdo, Srinivas Vasudevan, Dave Moore, Brian Patton, Alex Alemi, Matt Hoffman, Rif A. Saurous

Figure 1 for TensorFlow Distributions
Figure 2 for TensorFlow Distributions
Figure 3 for TensorFlow Distributions
Viaarxiv icon

Celeste: Variational inference for a generative model of astronomical images

Add code
Bookmark button
Alert button
Jun 03, 2015
Jeffrey Regier, Andrew Miller, Jon McAuliffe, Ryan Adams, Matt Hoffman, Dustin Lang, David Schlegel, Prabhat

Figure 1 for Celeste: Variational inference for a generative model of astronomical images
Figure 2 for Celeste: Variational inference for a generative model of astronomical images
Figure 3 for Celeste: Variational inference for a generative model of astronomical images
Figure 4 for Celeste: Variational inference for a generative model of astronomical images
Viaarxiv icon