Alert button
Picture for George Tucker

George Tucker

Alert button

Coupled Gradient Estimators for Discrete Latent Variables

Add code
Bookmark button
Alert button
Jun 15, 2021
Zhe Dong, Andriy Mnih, George Tucker

Figure 1 for Coupled Gradient Estimators for Discrete Latent Variables
Figure 2 for Coupled Gradient Estimators for Discrete Latent Variables
Figure 3 for Coupled Gradient Estimators for Discrete Latent Variables
Figure 4 for Coupled Gradient Estimators for Discrete Latent Variables
Viaarxiv icon

Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization

Add code
Bookmark button
Alert button
Apr 28, 2021
Michael R. Zhang, Tom Le Paine, Ofir Nachum, Cosmin Paduraru, George Tucker, Ziyu Wang, Mohammad Norouzi

Figure 1 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 2 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 3 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Figure 4 for Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Viaarxiv icon

Benchmarks for Deep Off-Policy Evaluation

Add code
Bookmark button
Alert button
Mar 30, 2021
Justin Fu, Mohammad Norouzi, Ofir Nachum, George Tucker, Ziyu Wang, Alexander Novikov, Mengjiao Yang, Michael R. Zhang, Yutian Chen, Aviral Kumar, Cosmin Paduraru, Sergey Levine, Tom Le Paine

Figure 1 for Benchmarks for Deep Off-Policy Evaluation
Figure 2 for Benchmarks for Deep Off-Policy Evaluation
Figure 3 for Benchmarks for Deep Off-Policy Evaluation
Figure 4 for Benchmarks for Deep Off-Policy Evaluation
Viaarxiv icon

Offline Policy Selection under Uncertainty

Add code
Bookmark button
Alert button
Dec 12, 2020
Mengjiao Yang, Bo Dai, Ofir Nachum, George Tucker, Dale Schuurmans

Figure 1 for Offline Policy Selection under Uncertainty
Figure 2 for Offline Policy Selection under Uncertainty
Figure 3 for Offline Policy Selection under Uncertainty
Figure 4 for Offline Policy Selection under Uncertainty
Viaarxiv icon

RL Unplugged: Benchmarks for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 02, 2020
Caglar Gulcehre, Ziyu Wang, Alexander Novikov, Tom Le Paine, Sergio Gomez Colmenarejo, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matt Hoffman, Ofir Nachum, George Tucker, Nicolas Heess, Nando de Freitas

Figure 1 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 2 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 3 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Figure 4 for RL Unplugged: Benchmarks for Offline Reinforcement Learning
Viaarxiv icon

Conservative Q-Learning for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 29, 2020
Aviral Kumar, Aurick Zhou, George Tucker, Sergey Levine

Figure 1 for Conservative Q-Learning for Offline Reinforcement Learning
Figure 2 for Conservative Q-Learning for Offline Reinforcement Learning
Figure 3 for Conservative Q-Learning for Offline Reinforcement Learning
Figure 4 for Conservative Q-Learning for Offline Reinforcement Learning
Viaarxiv icon

DisARM: An Antithetic Gradient Estimator for Binary Latent Variables

Add code
Bookmark button
Alert button
Jun 18, 2020
Zhe Dong, Andriy Mnih, George Tucker

Figure 1 for DisARM: An Antithetic Gradient Estimator for Binary Latent Variables
Figure 2 for DisARM: An Antithetic Gradient Estimator for Binary Latent Variables
Figure 3 for DisARM: An Antithetic Gradient Estimator for Binary Latent Variables
Figure 4 for DisARM: An Antithetic Gradient Estimator for Binary Latent Variables
Viaarxiv icon

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

Add code
Bookmark button
Alert button
May 04, 2020
Sergey Levine, Aviral Kumar, George Tucker, Justin Fu

Figure 1 for Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Figure 2 for Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Figure 3 for Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Figure 4 for Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Viaarxiv icon