Picture for Fei Xia

Fei Xia

Google DeepMind

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Add code
Feb 18, 2024
Figure 1 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 2 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 3 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 4 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Viaarxiv icon

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Add code
Feb 12, 2024
Figure 1 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 2 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 3 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 4 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Viaarxiv icon

Generative Expressive Robot Behaviors using Large Language Models

Add code
Jan 30, 2024
Viaarxiv icon

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

Add code
Jan 23, 2024
Viaarxiv icon

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities

Add code
Jan 22, 2024
Viaarxiv icon

Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

Add code
Dec 15, 2023
Figure 1 for Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
Figure 2 for Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
Figure 3 for Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
Figure 4 for Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
Viaarxiv icon

Chain of Code: Reasoning with a Language Model-Augmented Code Emulator

Add code
Dec 08, 2023
Figure 1 for Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Figure 2 for Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Figure 3 for Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Figure 4 for Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Viaarxiv icon

Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections

Add code
Nov 17, 2023
Figure 1 for Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Figure 2 for Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Figure 3 for Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Figure 4 for Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections
Viaarxiv icon

RoboVQA: Multimodal Long-Horizon Reasoning for Robotics

Add code
Nov 01, 2023
Figure 1 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 2 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 3 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 4 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Viaarxiv icon