Alert button
Picture for George Tucker

George Tucker

Alert button

Learning to Walk via Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 25, 2019
Tuomas Haarnoja, Sehoon Ha, Aurick Zhou, Jie Tan, George Tucker, Sergey Levine

Figure 1 for Learning to Walk via Deep Reinforcement Learning
Figure 2 for Learning to Walk via Deep Reinforcement Learning
Figure 3 for Learning to Walk via Deep Reinforcement Learning
Figure 4 for Learning to Walk via Deep Reinforcement Learning
Viaarxiv icon

Model-Based Reinforcement Learning for Atari

Add code
Bookmark button
Alert button
Mar 05, 2019
Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H Campbell, Konrad Czechowski, Dumitru Erhan, Chelsea Finn, Piotr Kozakowski, Sergey Levine, Ryan Sepassi, George Tucker, Henryk Michalewski

Figure 1 for Model-Based Reinforcement Learning for Atari
Figure 2 for Model-Based Reinforcement Learning for Atari
Figure 3 for Model-Based Reinforcement Learning for Atari
Figure 4 for Model-Based Reinforcement Learning for Atari
Viaarxiv icon

Soft Actor-Critic Algorithms and Applications

Add code
Bookmark button
Alert button
Jan 29, 2019
Tuomas Haarnoja, Aurick Zhou, Kristian Hartikainen, George Tucker, Sehoon Ha, Jie Tan, Vikash Kumar, Henry Zhu, Abhishek Gupta, Pieter Abbeel, Sergey Levine

Figure 1 for Soft Actor-Critic Algorithms and Applications
Figure 2 for Soft Actor-Critic Algorithms and Applications
Figure 3 for Soft Actor-Critic Algorithms and Applications
Figure 4 for Soft Actor-Critic Algorithms and Applications
Viaarxiv icon

The Laplacian in RL: Learning Representations with Efficient Approximations

Add code
Bookmark button
Alert button
Oct 10, 2018
Yifan Wu, George Tucker, Ofir Nachum

Figure 1 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 2 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 3 for The Laplacian in RL: Learning Representations with Efficient Approximations
Figure 4 for The Laplacian in RL: Learning Representations with Efficient Approximations
Viaarxiv icon

Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives

Add code
Bookmark button
Alert button
Oct 09, 2018
George Tucker, Dieterich Lawson, Shixiang Gu, Chris J. Maddison

Figure 1 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 2 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 3 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 4 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Viaarxiv icon

Smoothed Action Value Functions for Learning Gaussian Policies

Add code
Bookmark button
Alert button
Jul 25, 2018
Ofir Nachum, Mohammad Norouzi, George Tucker, Dale Schuurmans

Figure 1 for Smoothed Action Value Functions for Learning Gaussian Policies
Viaarxiv icon

Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion

Add code
Bookmark button
Alert button
Jul 04, 2018
Jacob Buckman, Danijar Hafner, George Tucker, Eugene Brevdo, Honglak Lee

Figure 1 for Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Figure 2 for Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Figure 3 for Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Figure 4 for Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Viaarxiv icon

Guided evolutionary strategies: escaping the curse of dimensionality in random search

Add code
Bookmark button
Alert button
Jun 28, 2018
Niru Maheswaranathan, Luke Metz, George Tucker, Jascha Sohl-Dickstein

Figure 1 for Guided evolutionary strategies: escaping the curse of dimensionality in random search
Figure 2 for Guided evolutionary strategies: escaping the curse of dimensionality in random search
Figure 3 for Guided evolutionary strategies: escaping the curse of dimensionality in random search
Figure 4 for Guided evolutionary strategies: escaping the curse of dimensionality in random search
Viaarxiv icon