Picture for Vidhi Jain

Vidhi Jain

Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge

Add code
Jul 09, 2024
Viaarxiv icon

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

Add code
Mar 19, 2024
Figure 1 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 2 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 3 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 4 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Viaarxiv icon

FlexCap: Generating Rich, Localized, and Flexible Captions in Images

Add code
Mar 18, 2024
Figure 1 for FlexCap: Generating Rich, Localized, and Flexible Captions in Images
Figure 2 for FlexCap: Generating Rich, Localized, and Flexible Captions in Images
Figure 3 for FlexCap: Generating Rich, Localized, and Flexible Captions in Images
Figure 4 for FlexCap: Generating Rich, Localized, and Flexible Captions in Images
Viaarxiv icon

Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

Add code
Dec 15, 2023
Viaarxiv icon

Open X-Embodiment: Robotic Learning Datasets and RT-X Models

Add code
Oct 17, 2023
Figure 1 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 2 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 3 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Figure 4 for Open X-Embodiment: Robotic Learning Datasets and RT-X Models
Viaarxiv icon

MAEA: Multimodal Attribution for Embodied AI

Add code
Jul 25, 2023
Figure 1 for MAEA: Multimodal Attribution for Embodied AI
Figure 2 for MAEA: Multimodal Attribution for Embodied AI
Figure 3 for MAEA: Multimodal Attribution for Embodied AI
Figure 4 for MAEA: Multimodal Attribution for Embodied AI
Viaarxiv icon

HomeRobot: Open-Vocabulary Mobile Manipulation

Add code
Jun 20, 2023
Figure 1 for HomeRobot: Open-Vocabulary Mobile Manipulation
Figure 2 for HomeRobot: Open-Vocabulary Mobile Manipulation
Figure 3 for HomeRobot: Open-Vocabulary Mobile Manipulation
Figure 4 for HomeRobot: Open-Vocabulary Mobile Manipulation
Viaarxiv icon

Transformers are Adaptable Task Planners

Add code
Jul 06, 2022
Figure 1 for Transformers are Adaptable Task Planners
Figure 2 for Transformers are Adaptable Task Planners
Figure 3 for Transformers are Adaptable Task Planners
Figure 4 for Transformers are Adaptable Task Planners
Viaarxiv icon

Learning Embeddings that Capture Spatial Semantics for Indoor Navigation

Add code
Jul 31, 2021
Figure 1 for Learning Embeddings that Capture Spatial Semantics for Indoor Navigation
Figure 2 for Learning Embeddings that Capture Spatial Semantics for Indoor Navigation
Viaarxiv icon

Predicting Human Strategies in Simulated Search and Rescue Task

Add code
Nov 19, 2020
Figure 1 for Predicting Human Strategies in Simulated Search and Rescue Task
Figure 2 for Predicting Human Strategies in Simulated Search and Rescue Task
Figure 3 for Predicting Human Strategies in Simulated Search and Rescue Task
Viaarxiv icon