Picture for Pete Florence

Pete Florence

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

Add code
Jul 10, 2024
Viaarxiv icon

SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities

Add code
Jan 22, 2024
Viaarxiv icon

RoboVQA: Multimodal Long-Horizon Reasoning for Robotics

Add code
Nov 01, 2023
Figure 1 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 2 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 3 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 4 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Viaarxiv icon

Video Language Planning

Add code
Oct 16, 2023
Figure 1 for Video Language Planning
Figure 2 for Video Language Planning
Figure 3 for Video Language Planning
Figure 4 for Video Language Planning
Viaarxiv icon

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Add code
Jul 28, 2023
Figure 1 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 2 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 3 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 4 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Viaarxiv icon

Towards Generalist Biomedical AI

Add code
Jul 26, 2023
Figure 1 for Towards Generalist Biomedical AI
Figure 2 for Towards Generalist Biomedical AI
Figure 3 for Towards Generalist Biomedical AI
Figure 4 for Towards Generalist Biomedical AI
Viaarxiv icon

Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition

Add code
Jul 26, 2023
Figure 1 for Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition
Figure 2 for Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition
Figure 3 for Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition
Figure 4 for Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition
Viaarxiv icon

Large Language Models as General Pattern Machines

Add code
Jul 10, 2023
Figure 1 for Large Language Models as General Pattern Machines
Figure 2 for Large Language Models as General Pattern Machines
Figure 3 for Large Language Models as General Pattern Machines
Figure 4 for Large Language Models as General Pattern Machines
Viaarxiv icon

RoboPianist: A Benchmark for High-Dimensional Robot Control

Add code
Apr 09, 2023
Figure 1 for RoboPianist: A Benchmark for High-Dimensional Robot Control
Figure 2 for RoboPianist: A Benchmark for High-Dimensional Robot Control
Figure 3 for RoboPianist: A Benchmark for High-Dimensional Robot Control
Figure 4 for RoboPianist: A Benchmark for High-Dimensional Robot Control
Viaarxiv icon

PaLM-E: An Embodied Multimodal Language Model

Add code
Mar 06, 2023
Figure 1 for PaLM-E: An Embodied Multimodal Language Model
Figure 2 for PaLM-E: An Embodied Multimodal Language Model
Figure 3 for PaLM-E: An Embodied Multimodal Language Model
Figure 4 for PaLM-E: An Embodied Multimodal Language Model
Viaarxiv icon