Picture for Swetha Rajkumar

Swetha Rajkumar

KALIE: Fine-Tuning Vision-Language Models for Open-World Manipulation without Robot Data

Add code
Sep 21, 2024
Figure 1 for KALIE: Fine-Tuning Vision-Language Models for Open-World Manipulation without Robot Data
Figure 2 for KALIE: Fine-Tuning Vision-Language Models for Open-World Manipulation without Robot Data
Figure 3 for KALIE: Fine-Tuning Vision-Language Models for Open-World Manipulation without Robot Data
Figure 4 for KALIE: Fine-Tuning Vision-Language Models for Open-World Manipulation without Robot Data
Viaarxiv icon