Picture for Dan Horgan

Dan Horgan

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Vision-Language Models as a Source of Rewards

Add code
Dec 14, 2023
Figure 1 for Vision-Language Models as a Source of Rewards
Figure 2 for Vision-Language Models as a Source of Rewards
Figure 3 for Vision-Language Models as a Source of Rewards
Figure 4 for Vision-Language Models as a Source of Rewards
Viaarxiv icon

Unicorn: Continual Learning with a Universal, Off-policy Agent

Add code
Jul 03, 2018
Figure 1 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 2 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 3 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Figure 4 for Unicorn: Continual Learning with a Universal, Off-policy Agent
Viaarxiv icon

Observe and Look Further: Achieving Consistent Performance on Atari

Add code
May 29, 2018
Figure 1 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 2 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 3 for Observe and Look Further: Achieving Consistent Performance on Atari
Figure 4 for Observe and Look Further: Achieving Consistent Performance on Atari
Viaarxiv icon

Distributed Distributional Deterministic Policy Gradients

Add code
Apr 23, 2018
Figure 1 for Distributed Distributional Deterministic Policy Gradients
Figure 2 for Distributed Distributional Deterministic Policy Gradients
Figure 3 for Distributed Distributional Deterministic Policy Gradients
Figure 4 for Distributed Distributional Deterministic Policy Gradients
Viaarxiv icon

Distributed Prioritized Experience Replay

Add code
Mar 02, 2018
Figure 1 for Distributed Prioritized Experience Replay
Figure 2 for Distributed Prioritized Experience Replay
Figure 3 for Distributed Prioritized Experience Replay
Figure 4 for Distributed Prioritized Experience Replay
Viaarxiv icon

Deep Q-learning from Demonstrations

Add code
Nov 22, 2017
Figure 1 for Deep Q-learning from Demonstrations
Figure 2 for Deep Q-learning from Demonstrations
Figure 3 for Deep Q-learning from Demonstrations
Viaarxiv icon

Rainbow: Combining Improvements in Deep Reinforcement Learning

Add code
Oct 06, 2017
Figure 1 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 2 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 3 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Figure 4 for Rainbow: Combining Improvements in Deep Reinforcement Learning
Viaarxiv icon