Picture for Ayzaan Wahid

Ayzaan Wahid

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

Add code
Mar 19, 2024
Figure 1 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 2 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 3 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 4 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Viaarxiv icon

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Add code
Feb 18, 2024
Figure 1 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 2 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 3 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 4 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Viaarxiv icon

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Add code
Feb 12, 2024
Figure 1 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 2 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 3 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 4 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Viaarxiv icon

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Add code
Oct 17, 2023
Figure 1 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 2 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 3 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 4 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Viaarxiv icon

Video Language Planning

Add code
Oct 16, 2023
Figure 1 for Video Language Planning
Figure 2 for Video Language Planning
Figure 3 for Video Language Planning
Figure 4 for Video Language Planning
Viaarxiv icon

RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control

Add code
Jul 28, 2023
Figure 1 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 2 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 3 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Figure 4 for RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Viaarxiv icon

PaLM-E: An Embodied Multimodal Language Model

Add code
Mar 06, 2023
Figure 1 for PaLM-E: An Embodied Multimodal Language Model
Figure 2 for PaLM-E: An Embodied Multimodal Language Model
Figure 3 for PaLM-E: An Embodied Multimodal Language Model
Figure 4 for PaLM-E: An Embodied Multimodal Language Model
Viaarxiv icon

Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models

Add code
Nov 22, 2022
Figure 1 for Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models
Figure 2 for Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models
Figure 3 for Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models
Figure 4 for Robotic Skill Acquisition via Instruction Augmentation with Vision-Language Models
Viaarxiv icon

Interactive Language: Talking to Robots in Real Time

Add code
Oct 12, 2022
Figure 1 for Interactive Language: Talking to Robots in Real Time
Figure 2 for Interactive Language: Talking to Robots in Real Time
Figure 3 for Interactive Language: Talking to Robots in Real Time
Figure 4 for Interactive Language: Talking to Robots in Real Time
Viaarxiv icon

Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations

Add code
May 12, 2022
Figure 1 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 2 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 3 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 4 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Viaarxiv icon