Picture for Jimmy Ba

Jimmy Ba

Exploring Model-based Planning with Policy Networks

Add code
Jun 20, 2019
Figure 1 for Exploring Model-based Planning with Policy Networks
Figure 2 for Exploring Model-based Planning with Policy Networks
Figure 3 for Exploring Model-based Planning with Policy Networks
Figure 4 for Exploring Model-based Planning with Policy Networks
Viaarxiv icon

Neural Graph Evolution: Towards Efficient Automatic Robot Design

Add code
Jun 12, 2019
Figure 1 for Neural Graph Evolution: Towards Efficient Automatic Robot Design
Figure 2 for Neural Graph Evolution: Towards Efficient Automatic Robot Design
Figure 3 for Neural Graph Evolution: Towards Efficient Automatic Robot Design
Figure 4 for Neural Graph Evolution: Towards Efficient Automatic Robot Design
Viaarxiv icon

Graph Normalizing Flows

Add code
May 30, 2019
Figure 1 for Graph Normalizing Flows
Figure 2 for Graph Normalizing Flows
Figure 3 for Graph Normalizing Flows
Figure 4 for Graph Normalizing Flows
Viaarxiv icon

Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise

Add code
Apr 03, 2019
Figure 1 for Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise
Figure 2 for Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise
Figure 3 for Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise
Figure 4 for Interplay Between Optimization and Generalization of Stochastic Gradient Descent with Covariance Noise
Viaarxiv icon

DOM-Q-NET: Grounded RL on Structured Language

Add code
Feb 19, 2019
Figure 1 for DOM-Q-NET: Grounded RL on Structured Language
Figure 2 for DOM-Q-NET: Grounded RL on Structured Language
Figure 3 for DOM-Q-NET: Grounded RL on Structured Language
Figure 4 for DOM-Q-NET: Grounded RL on Structured Language
Viaarxiv icon

ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning

Add code
Feb 12, 2019
Figure 1 for ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
Figure 2 for ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
Figure 3 for ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
Figure 4 for ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning
Viaarxiv icon

Reversible Recurrent Neural Networks

Add code
Oct 25, 2018
Figure 1 for Reversible Recurrent Neural Networks
Figure 2 for Reversible Recurrent Neural Networks
Figure 3 for Reversible Recurrent Neural Networks
Figure 4 for Reversible Recurrent Neural Networks
Viaarxiv icon

On the Convergence and Robustness of Training GANs with Regularized Optimal Transport

Add code
May 22, 2018
Figure 1 for On the Convergence and Robustness of Training GANs with Regularized Optimal Transport
Figure 2 for On the Convergence and Robustness of Training GANs with Regularized Optimal Transport
Figure 3 for On the Convergence and Robustness of Training GANs with Regularized Optimal Transport
Figure 4 for On the Convergence and Robustness of Training GANs with Regularized Optimal Transport
Viaarxiv icon

Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-Batches

Add code
Apr 02, 2018
Figure 1 for Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-Batches
Figure 2 for Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-Batches
Figure 3 for Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-Batches
Figure 4 for Flipout: Efficient Pseudo-Independent Weight Perturbations on Mini-Batches
Viaarxiv icon

Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation

Add code
Aug 18, 2017
Figure 1 for Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Figure 2 for Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Figure 3 for Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Figure 4 for Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Viaarxiv icon