Picture for Debidatta Dwibedi

Debidatta Dwibedi

Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers

Add code
Mar 19, 2024
Figure 1 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 2 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 3 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Figure 4 for Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers
Viaarxiv icon

FlexCap: Generating Rich, Localized, and Flexible Captions in Images

Add code
Mar 18, 2024
Figure 1 for FlexCap: Generating Rich, Localized, and Flexible Captions in Images
Figure 2 for FlexCap: Generating Rich, Localized, and Flexible Captions in Images
Figure 3 for FlexCap: Generating Rich, Localized, and Flexible Captions in Images
Figure 4 for FlexCap: Generating Rich, Localized, and Flexible Captions in Images
Viaarxiv icon

RT-H: Action Hierarchies Using Language

Add code
Mar 04, 2024
Figure 1 for RT-H: Action Hierarchies Using Language
Figure 2 for RT-H: Action Hierarchies Using Language
Figure 3 for RT-H: Action Hierarchies Using Language
Figure 4 for RT-H: Action Hierarchies Using Language
Viaarxiv icon

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents

Add code
Jan 23, 2024
Viaarxiv icon

RoboVQA: Multimodal Long-Horizon Reasoning for Robotics

Add code
Nov 01, 2023
Figure 1 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 2 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 3 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Figure 4 for RoboVQA: Multimodal Long-Horizon Reasoning for Robotics
Viaarxiv icon

Q-Match: Self-Supervised Learning by Matching Distributions Induced by a Queue

Add code
Feb 22, 2023
Figure 1 for Q-Match: Self-Supervised Learning by Matching Distributions Induced by a Queue
Figure 2 for Q-Match: Self-Supervised Learning by Matching Distributions Induced by a Queue
Figure 3 for Q-Match: Self-Supervised Learning by Matching Distributions Induced by a Queue
Figure 4 for Q-Match: Self-Supervised Learning by Matching Distributions Induced by a Queue
Viaarxiv icon

Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations

Add code
May 12, 2022
Figure 1 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 2 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 3 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Figure 4 for Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations
Viaarxiv icon

XIRL: Cross-embodiment Inverse Reinforcement Learning

Add code
Jun 07, 2021
Figure 1 for XIRL: Cross-embodiment Inverse Reinforcement Learning
Figure 2 for XIRL: Cross-embodiment Inverse Reinforcement Learning
Figure 3 for XIRL: Cross-embodiment Inverse Reinforcement Learning
Figure 4 for XIRL: Cross-embodiment Inverse Reinforcement Learning
Viaarxiv icon

With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations

Add code
Apr 29, 2021
Figure 1 for With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations
Figure 2 for With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations
Figure 3 for With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations
Figure 4 for With a Little Help from My Friends: Nearest-Neighbor Contrastive Learning of Visual Representations
Viaarxiv icon

Counting Out Time: Class Agnostic Video Repetition Counting in the Wild

Add code
Jun 27, 2020
Figure 1 for Counting Out Time: Class Agnostic Video Repetition Counting in the Wild
Figure 2 for Counting Out Time: Class Agnostic Video Repetition Counting in the Wild
Figure 3 for Counting Out Time: Class Agnostic Video Repetition Counting in the Wild
Figure 4 for Counting Out Time: Class Agnostic Video Repetition Counting in the Wild
Viaarxiv icon