Picture for Arthur Guez

Arthur Guez

A Unifying Framework for Action-Conditional Self-Predictive Reinforcement Learning

Add code
Jun 04, 2024
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Optimism and Adaptivity in Policy Optimization

Add code
Jun 18, 2023
Figure 1 for Optimism and Adaptivity in Policy Optimization
Figure 2 for Optimism and Adaptivity in Policy Optimization
Figure 3 for Optimism and Adaptivity in Policy Optimization
Figure 4 for Optimism and Adaptivity in Policy Optimization
Viaarxiv icon

Large-Scale Retrieval for Reinforcement Learning

Add code
Jun 10, 2022
Figure 1 for Large-Scale Retrieval for Reinforcement Learning
Figure 2 for Large-Scale Retrieval for Reinforcement Learning
Figure 3 for Large-Scale Retrieval for Reinforcement Learning
Figure 4 for Large-Scale Retrieval for Reinforcement Learning
Viaarxiv icon

COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation

Add code
Apr 19, 2022
Figure 1 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 2 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 3 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Figure 4 for COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation
Viaarxiv icon

Retrieval-Augmented Reinforcement Learning

Add code
Mar 09, 2022
Figure 1 for Retrieval-Augmented Reinforcement Learning
Figure 2 for Retrieval-Augmented Reinforcement Learning
Figure 3 for Retrieval-Augmented Reinforcement Learning
Figure 4 for Retrieval-Augmented Reinforcement Learning
Viaarxiv icon

Muesli: Combining Improvements in Policy Optimization

Add code
Apr 13, 2021
Figure 1 for Muesli: Combining Improvements in Policy Optimization
Figure 2 for Muesli: Combining Improvements in Policy Optimization
Figure 3 for Muesli: Combining Improvements in Policy Optimization
Figure 4 for Muesli: Combining Improvements in Policy Optimization
Viaarxiv icon

Counterfactual Credit Assignment in Model-Free Reinforcement Learning

Add code
Nov 18, 2020
Figure 1 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 2 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 3 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Figure 4 for Counterfactual Credit Assignment in Model-Free Reinforcement Learning
Viaarxiv icon

On the role of planning in model-based deep reinforcement learning

Add code
Nov 08, 2020
Figure 1 for On the role of planning in model-based deep reinforcement learning
Figure 2 for On the role of planning in model-based deep reinforcement learning
Figure 3 for On the role of planning in model-based deep reinforcement learning
Figure 4 for On the role of planning in model-based deep reinforcement learning
Viaarxiv icon