Alert button
Picture for George Tucker

George Tucker

Alert button

The Mirage of Action-Dependent Baselines in Reinforcement Learning

Apr 06, 2018
George Tucker, Surya Bhupatiraju, Shixiang Gu, Richard E. Turner, Zoubin Ghahramani, Sergey Levine

Figure 1 for The Mirage of Action-Dependent Baselines in Reinforcement Learning
Figure 2 for The Mirage of Action-Dependent Baselines in Reinforcement Learning
Figure 3 for The Mirage of Action-Dependent Baselines in Reinforcement Learning
Figure 4 for The Mirage of Action-Dependent Baselines in Reinforcement Learning
Viaarxiv icon

Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling

Feb 26, 2018
Carlos Riquelme, George Tucker, Jasper Snoek

Figure 1 for Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
Figure 2 for Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
Figure 3 for Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
Figure 4 for Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
Viaarxiv icon

Filtering Variational Objectives

Nov 12, 2017
Chris J. Maddison, Dieterich Lawson, George Tucker, Nicolas Heess, Mohammad Norouzi, Andriy Mnih, Arnaud Doucet, Yee Whye Teh

Figure 1 for Filtering Variational Objectives
Figure 2 for Filtering Variational Objectives
Figure 3 for Filtering Variational Objectives
Figure 4 for Filtering Variational Objectives
Viaarxiv icon

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

Nov 06, 2017
George Tucker, Andriy Mnih, Chris J. Maddison, Dieterich Lawson, Jascha Sohl-Dickstein

Figure 1 for REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models
Figure 2 for REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models
Figure 3 for REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models
Viaarxiv icon

Learning Hard Alignments with Variational Inference

Nov 01, 2017
Dieterich Lawson, Chung-Cheng Chiu, George Tucker, Colin Raffel, Kevin Swersky, Navdeep Jaitly

Figure 1 for Learning Hard Alignments with Variational Inference
Figure 2 for Learning Hard Alignments with Variational Inference
Figure 3 for Learning Hard Alignments with Variational Inference
Viaarxiv icon

An online sequence-to-sequence model for noisy speech recognition

Jun 16, 2017
Chung-Cheng Chiu, Dieterich Lawson, Yuping Luo, George Tucker, Kevin Swersky, Ilya Sutskever, Navdeep Jaitly

Figure 1 for An online sequence-to-sequence model for noisy speech recognition
Figure 2 for An online sequence-to-sequence model for noisy speech recognition
Figure 3 for An online sequence-to-sequence model for noisy speech recognition
Figure 4 for An online sequence-to-sequence model for noisy speech recognition
Viaarxiv icon

Compacting Neural Network Classifiers via Dropout Training

May 24, 2017
Yotaro Kubo, George Tucker, Simon Wiesler

Figure 1 for Compacting Neural Network Classifiers via Dropout Training
Viaarxiv icon

Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting

May 05, 2017
Ming Sun, Anirudh Raju, George Tucker, Sankaran Panchapagesan, Gengshen Fu, Arindam Mandal, Spyros Matsoukas, Nikko Strom, Shiv Vitaladevuni

Figure 1 for Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting
Figure 2 for Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting
Figure 3 for Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting
Figure 4 for Max-Pooling Loss Training of Long Short-Term Memory Networks for Small-Footprint Keyword Spotting
Viaarxiv icon

Particle Value Functions

Mar 16, 2017
Chris J. Maddison, Dieterich Lawson, George Tucker, Nicolas Heess, Arnaud Doucet, Andriy Mnih, Yee Whye Teh

Figure 1 for Particle Value Functions
Figure 2 for Particle Value Functions
Figure 3 for Particle Value Functions
Figure 4 for Particle Value Functions
Viaarxiv icon

Regularizing Neural Networks by Penalizing Confident Output Distributions

Jan 23, 2017
Gabriel Pereyra, George Tucker, Jan Chorowski, Łukasz Kaiser, Geoffrey Hinton

Figure 1 for Regularizing Neural Networks by Penalizing Confident Output Distributions
Figure 2 for Regularizing Neural Networks by Penalizing Confident Output Distributions
Figure 3 for Regularizing Neural Networks by Penalizing Confident Output Distributions
Figure 4 for Regularizing Neural Networks by Penalizing Confident Output Distributions
Viaarxiv icon