Picture for Zafarali Ahmed

Zafarali Ahmed

Gemma: Open Models Based on Gemini Research and Technology

Add code
Mar 13, 2024
Figure 1 for Gemma: Open Models Based on Gemini Research and Technology
Figure 2 for Gemma: Open Models Based on Gemini Research and Technology
Figure 3 for Gemma: Open Models Based on Gemini Research and Technology
Figure 4 for Gemma: Open Models Based on Gemini Research and Technology
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning

Add code
Apr 21, 2022
Figure 1 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 2 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 3 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 4 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Viaarxiv icon

Temporally Abstract Partial Models

Add code
Aug 06, 2021
Figure 1 for Temporally Abstract Partial Models
Figure 2 for Temporally Abstract Partial Models
Figure 3 for Temporally Abstract Partial Models
Figure 4 for Temporally Abstract Partial Models
Viaarxiv icon

AndroidEnv: A Reinforcement Learning Platform for Android

Add code
May 27, 2021
Figure 1 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 2 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 3 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 4 for AndroidEnv: A Reinforcement Learning Platform for Android
Viaarxiv icon

Training a First-Order Theorem Prover from Synthetic Data

Add code
Mar 05, 2021
Figure 1 for Training a First-Order Theorem Prover from Synthetic Data
Figure 2 for Training a First-Order Theorem Prover from Synthetic Data
Figure 3 for Training a First-Order Theorem Prover from Synthetic Data
Figure 4 for Training a First-Order Theorem Prover from Synthetic Data
Viaarxiv icon

What can I do here? A Theory of Affordances in Reinforcement Learning

Add code
Jun 26, 2020
Figure 1 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 2 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 3 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 4 for What can I do here? A Theory of Affordances in Reinforcement Learning
Viaarxiv icon

Learning to Prove from Synthetic Theorems

Add code
Jun 19, 2020
Figure 1 for Learning to Prove from Synthetic Theorems
Figure 2 for Learning to Prove from Synthetic Theorems
Figure 3 for Learning to Prove from Synthetic Theorems
Figure 4 for Learning to Prove from Synthetic Theorems
Viaarxiv icon

Marginalized State Distribution Entropy Regularization in Policy Optimization

Add code
Dec 11, 2019
Figure 1 for Marginalized State Distribution Entropy Regularization in Policy Optimization
Figure 2 for Marginalized State Distribution Entropy Regularization in Policy Optimization
Figure 3 for Marginalized State Distribution Entropy Regularization in Policy Optimization
Figure 4 for Marginalized State Distribution Entropy Regularization in Policy Optimization
Viaarxiv icon