Picture for Michael Teng

Michael Teng

Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning

Feb 06, 2022
Figure 1 for Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning
Figure 2 for Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning
Figure 3 for Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning
Figure 4 for Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning
Viaarxiv icon

Semi-supervised Sequential Generative Models

Jun 30, 2020
Figure 1 for Semi-supervised Sequential Generative Models
Figure 2 for Semi-supervised Sequential Generative Models
Figure 3 for Semi-supervised Sequential Generative Models
Figure 4 for Semi-supervised Sequential Generative Models
Viaarxiv icon

Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training

Add code
Jun 13, 2019
Figure 1 for Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training
Figure 2 for Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training
Figure 3 for Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training
Figure 4 for Near-Optimal Glimpse Sequences for Improved Hard Attention Neural Network Training
Viaarxiv icon

Imitation Learning of Factored Multi-agent Reactive Models

Mar 12, 2019
Figure 1 for Imitation Learning of Factored Multi-agent Reactive Models
Figure 2 for Imitation Learning of Factored Multi-agent Reactive Models
Figure 3 for Imitation Learning of Factored Multi-agent Reactive Models
Figure 4 for Imitation Learning of Factored Multi-agent Reactive Models
Viaarxiv icon

High Throughput Synchronous Distributed Stochastic Gradient Descent

Mar 12, 2018
Figure 1 for High Throughput Synchronous Distributed Stochastic Gradient Descent
Figure 2 for High Throughput Synchronous Distributed Stochastic Gradient Descent
Figure 3 for High Throughput Synchronous Distributed Stochastic Gradient Descent
Figure 4 for High Throughput Synchronous Distributed Stochastic Gradient Descent
Viaarxiv icon