Picture for George Tucker

George Tucker

Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives

Add code
Oct 09, 2018
Figure 1 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 2 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 3 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Figure 4 for Doubly Reparameterized Gradient Estimators for Monte Carlo Objectives
Viaarxiv icon

Smoothed Action Value Functions for Learning Gaussian Policies

Add code
Jul 25, 2018
Figure 1 for Smoothed Action Value Functions for Learning Gaussian Policies
Viaarxiv icon

Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion

Add code
Jul 04, 2018
Figure 1 for Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Figure 2 for Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Figure 3 for Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Figure 4 for Sample-Efficient Reinforcement Learning with Stochastic Ensemble Value Expansion
Viaarxiv icon

Guided evolutionary strategies: escaping the curse of dimensionality in random search

Add code
Jun 28, 2018
Figure 1 for Guided evolutionary strategies: escaping the curse of dimensionality in random search
Figure 2 for Guided evolutionary strategies: escaping the curse of dimensionality in random search
Figure 3 for Guided evolutionary strategies: escaping the curse of dimensionality in random search
Figure 4 for Guided evolutionary strategies: escaping the curse of dimensionality in random search
Viaarxiv icon

The Mirage of Action-Dependent Baselines in Reinforcement Learning

Add code
Apr 06, 2018
Figure 1 for The Mirage of Action-Dependent Baselines in Reinforcement Learning
Figure 2 for The Mirage of Action-Dependent Baselines in Reinforcement Learning
Figure 3 for The Mirage of Action-Dependent Baselines in Reinforcement Learning
Figure 4 for The Mirage of Action-Dependent Baselines in Reinforcement Learning
Viaarxiv icon

Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling

Add code
Feb 26, 2018
Figure 1 for Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
Figure 2 for Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
Figure 3 for Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
Figure 4 for Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
Viaarxiv icon

Filtering Variational Objectives

Add code
Nov 12, 2017
Figure 1 for Filtering Variational Objectives
Figure 2 for Filtering Variational Objectives
Figure 3 for Filtering Variational Objectives
Figure 4 for Filtering Variational Objectives
Viaarxiv icon

REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models

Add code
Nov 06, 2017
Figure 1 for REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models
Figure 2 for REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models
Figure 3 for REBAR: Low-variance, unbiased gradient estimates for discrete latent variable models
Viaarxiv icon

Learning Hard Alignments with Variational Inference

Add code
Nov 01, 2017
Figure 1 for Learning Hard Alignments with Variational Inference
Figure 2 for Learning Hard Alignments with Variational Inference
Figure 3 for Learning Hard Alignments with Variational Inference
Viaarxiv icon

An online sequence-to-sequence model for noisy speech recognition

Add code
Jun 16, 2017
Figure 1 for An online sequence-to-sequence model for noisy speech recognition
Figure 2 for An online sequence-to-sequence model for noisy speech recognition
Figure 3 for An online sequence-to-sequence model for noisy speech recognition
Figure 4 for An online sequence-to-sequence model for noisy speech recognition
Viaarxiv icon