Picture for Arun Ahuja

Arun Ahuja

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Hierarchical reinforcement learning with natural language subgoals

Add code
Sep 20, 2023
Figure 1 for Hierarchical reinforcement learning with natural language subgoals
Figure 2 for Hierarchical reinforcement learning with natural language subgoals
Figure 3 for Hierarchical reinforcement learning with natural language subgoals
Figure 4 for Hierarchical reinforcement learning with natural language subgoals
Viaarxiv icon

Collaborating with language models for embodied reasoning

Add code
Feb 01, 2023
Figure 1 for Collaborating with language models for embodied reasoning
Figure 2 for Collaborating with language models for embodied reasoning
Figure 3 for Collaborating with language models for embodied reasoning
Viaarxiv icon

Distilling Internet-Scale Vision-Language Models into Embodied Agents

Add code
Jan 29, 2023
Figure 1 for Distilling Internet-Scale Vision-Language Models into Embodied Agents
Figure 2 for Distilling Internet-Scale Vision-Language Models into Embodied Agents
Figure 3 for Distilling Internet-Scale Vision-Language Models into Embodied Agents
Figure 4 for Distilling Internet-Scale Vision-Language Models into Embodied Agents
Viaarxiv icon

Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

Add code
Nov 21, 2022
Figure 1 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 2 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 3 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Figure 4 for Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback
Viaarxiv icon

Learning to Navigate Wikipedia by Taking Random Walks

Add code
Oct 31, 2022
Figure 1 for Learning to Navigate Wikipedia by Taking Random Walks
Figure 2 for Learning to Navigate Wikipedia by Taking Random Walks
Figure 3 for Learning to Navigate Wikipedia by Taking Random Walks
Figure 4 for Learning to Navigate Wikipedia by Taking Random Walks
Viaarxiv icon

Evaluating Multimodal Interactive Agents

Add code
May 26, 2022
Figure 1 for Evaluating Multimodal Interactive Agents
Figure 2 for Evaluating Multimodal Interactive Agents
Figure 3 for Evaluating Multimodal Interactive Agents
Figure 4 for Evaluating Multimodal Interactive Agents
Viaarxiv icon

Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning

Add code
Dec 07, 2021
Figure 1 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 2 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 3 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Figure 4 for Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning
Viaarxiv icon

Imitation by Predicting Observations

Add code
Jul 08, 2021
Figure 1 for Imitation by Predicting Observations
Figure 2 for Imitation by Predicting Observations
Figure 3 for Imitation by Predicting Observations
Figure 4 for Imitation by Predicting Observations
Viaarxiv icon