Picture for Gabriel Barth-Maron

Gabriel Barth-Maron

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

A Generalist Agent

Add code
May 19, 2022
Figure 1 for A Generalist Agent
Figure 2 for A Generalist Agent
Figure 3 for A Generalist Agent
Figure 4 for A Generalist Agent
Viaarxiv icon

Launchpad: A Programming Model for Distributed Machine Learning Research

Add code
Jun 07, 2021
Figure 1 for Launchpad: A Programming Model for Distributed Machine Learning Research
Figure 2 for Launchpad: A Programming Model for Distributed Machine Learning Research
Viaarxiv icon

Reverb: A Framework For Experience Replay

Add code
Feb 09, 2021
Figure 1 for Reverb: A Framework For Experience Replay
Figure 2 for Reverb: A Framework For Experience Replay
Figure 3 for Reverb: A Framework For Experience Replay
Figure 4 for Reverb: A Framework For Experience Replay
Viaarxiv icon

Acme: A Research Framework for Distributed Reinforcement Learning

Add code
Jun 01, 2020
Figure 1 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 2 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 3 for Acme: A Research Framework for Distributed Reinforcement Learning
Figure 4 for Acme: A Research Framework for Distributed Reinforcement Learning
Viaarxiv icon

Making Efficient Use of Demonstrations to Solve Hard Exploration Problems

Add code
Sep 03, 2019
Figure 1 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 2 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 3 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Figure 4 for Making Efficient Use of Demonstrations to Solve Hard Exploration Problems
Viaarxiv icon

One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL

Add code
Oct 11, 2018
Figure 1 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 2 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 3 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Figure 4 for One-Shot High-Fidelity Imitation: Training Large-Scale Deep Nets with RL
Viaarxiv icon

Observe and Look Further: Achieving Consistent Performance on Atari

Add code
May 29, 2018
Figure 1 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 2 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 3 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 4 for Observe and Look Further: Achieving Consistent Performance on Atari
Viaarxiv icon

Distributed Distributional Deterministic Policy Gradients

Add code
Apr 23, 2018
Figure 1 for Distributed Distributional Deterministic Policy Gradients
Figure 2 for Distributed Distributional Deterministic Policy Gradients
Figure 3 for Distributed Distributional Deterministic Policy Gradients
Figure 4 for Distributed Distributional Deterministic Policy Gradients
Viaarxiv icon

Distributed Prioritized Experience Replay

Add code
Mar 02, 2018
Figure 1 for Distributed Prioritized Experience Replay
Figure 2 for Distributed Prioritized Experience Replay
Figure 3 for Distributed Prioritized Experience Replay
Figure 4 for Distributed Prioritized Experience Replay
Viaarxiv icon