Picture for George Tucker

George Tucker

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Guided Evolution with Binary Discriminators for ML Program Search

Add code
Feb 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

Add code
Oct 12, 2023
Figure 1 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 2 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 3 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 4 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Viaarxiv icon

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

Add code
Dec 21, 2022
Figure 1 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 2 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 3 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 4 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Viaarxiv icon

Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes

Add code
Nov 28, 2022
Figure 1 for Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Figure 2 for Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Figure 3 for Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Figure 4 for Offline Q-Learning on Diverse Multi-Task Data Both Scales And Generalizes
Viaarxiv icon

Oracle Inequalities for Model Selection in Offline Reinforcement Learning

Add code
Nov 03, 2022
Figure 1 for Oracle Inequalities for Model Selection in Offline Reinforcement Learning
Viaarxiv icon

Model Selection in Batch Policy Optimization

Add code
Dec 23, 2021
Figure 1 for Model Selection in Batch Policy Optimization
Figure 2 for Model Selection in Batch Policy Optimization
Viaarxiv icon

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization

Add code
Dec 09, 2021
Figure 1 for DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Figure 2 for DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Figure 3 for DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Figure 4 for DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Viaarxiv icon