Picture for Nando de Freitas

Nando de Freitas

University of British Columbia

Acme: A Research Framework for Distributed Reinforcement Learning

Add code
Jun 01, 2020
Figure 1 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 2 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 3 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 4 for Acme: A Research Framework for Distributed Reinforcement Learning
Viaarxiv icon

Task-Relevant Adversarial Imitation Learning

Add code
Oct 02, 2019
Figure 1 for Task-Relevant Adversarial Imitation Learning
Figure 2 for Task-Relevant Adversarial Imitation Learning
Figure 3 for Task-Relevant Adversarial Imitation Learning
Figure 4 for Task-Relevant Adversarial Imitation Learning
Viaarxiv icon

A Framework for Data-Driven Robotics

Add code
Sep 26, 2019
Figure 1 for A Framework for Data-Driven Robotics
Figure 2 for A Framework for Data-Driven Robotics
Figure 3 for A Framework for Data-Driven Robotics
Figure 4 for A Framework for Data-Driven Robotics
Viaarxiv icon

Modular Meta-Learning with Shrinkage

Add code
Sep 12, 2019
Figure 1 for Modular Meta-Learning with Shrinkage
Figure 2 for Modular Meta-Learning with Shrinkage
Figure 3 for Modular Meta-Learning with Shrinkage
Figure 4 for Modular Meta-Learning with Shrinkage
Viaarxiv icon

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

Add code
Sep 03, 2019
Figure 1 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 2 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 3 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 4 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Viaarxiv icon

Learning Compositional Neural Programs with Recursive Tree Search and Planning

Add code
May 30, 2019
Figure 1 for Learning Compositional Neural Programs with Recursive Tree Search and Planning
Figure 2 for Learning Compositional Neural Programs with Recursive Tree Search and Planning
Figure 3 for Learning Compositional Neural Programs with Recursive Tree Search and Planning
Figure 4 for Learning Compositional Neural Programs with Recursive Tree Search and Planning
Viaarxiv icon

Meta-learning of Sequential Strategies

Add code
May 08, 2019
Figure 1 for Meta-learning of Sequential Strategies
Figure 2 for Meta-learning of Sequential Strategies
Figure 3 for Meta-learning of Sequential Strategies
Figure 4 for Meta-learning of Sequential Strategies
Viaarxiv icon

Bayesian Optimization in AlphaGo

Add code
Dec 17, 2018
Figure 1 for Bayesian Optimization in AlphaGo
Figure 2 for Bayesian Optimization in AlphaGo
Figure 3 for Bayesian Optimization in AlphaGo
Figure 4 for Bayesian Optimization in AlphaGo
Viaarxiv icon

Intrinsic Social Motivation via Causal Influence in Multi-Agent RL

Add code
Oct 19, 2018
Figure 1 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 2 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 3 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 4 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Viaarxiv icon

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL

Add code
Oct 11, 2018
Figure 1 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 2 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 3 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 4 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Viaarxiv icon