Picture for Roberto Martín-Martín

Roberto Martín-Martín

Modeling Dynamic Environments with Scene Graph Memory

Add code
Jun 12, 2023
Viaarxiv icon

ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding

Add code
May 18, 2023
Viaarxiv icon

Causal Policy Gradient for Whole-Body Mobile Manipulation

Add code
May 11, 2023
Viaarxiv icon

Procedure-Aware Pretraining for Instructional Video Understanding

Add code
Mar 31, 2023
Figure 1 for Procedure-Aware Pretraining for Instructional Video Understanding
Figure 2 for Procedure-Aware Pretraining for Instructional Video Understanding
Figure 3 for Procedure-Aware Pretraining for Instructional Video Understanding
Figure 4 for Procedure-Aware Pretraining for Instructional Video Understanding
Viaarxiv icon

Model-Agnostic Hierarchical Attention for 3D Object Detection

Add code
Jan 06, 2023
Figure 1 for Model-Agnostic Hierarchical Attention for 3D Object Detection
Figure 2 for Model-Agnostic Hierarchical Attention for 3D Object Detection
Figure 3 for Model-Agnostic Hierarchical Attention for 3D Object Detection
Figure 4 for Model-Agnostic Hierarchical Attention for 3D Object Detection
Viaarxiv icon

ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding

Add code
Dec 10, 2022
Figure 1 for ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding
Figure 2 for ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding
Figure 3 for ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding
Figure 4 for ULIP: Learning Unified Representation of Language, Image and Point Cloud for 3D Understanding
Viaarxiv icon

Retrospectives on the Embodied AI Workshop

Add code
Oct 17, 2022
Figure 1 for Retrospectives on the Embodied AI Workshop
Figure 2 for Retrospectives on the Embodied AI Workshop
Figure 3 for Retrospectives on the Embodied AI Workshop
Figure 4 for Retrospectives on the Embodied AI Workshop
Viaarxiv icon

MaskViT: Masked Visual Pre-Training for Video Prediction

Add code
Jun 23, 2022
Figure 1 for MaskViT: Masked Visual Pre-Training for Video Prediction
Figure 2 for MaskViT: Masked Visual Pre-Training for Video Prediction
Figure 3 for MaskViT: Masked Visual Pre-Training for Video Prediction
Figure 4 for MaskViT: Masked Visual Pre-Training for Video Prediction
Viaarxiv icon

BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents

Add code
Jun 13, 2022
Figure 1 for BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents
Figure 2 for BEHAVIOR in Habitat 2.0: Simulator-Independent Logical Task Description for Benchmarking Embodied AI Agents
Viaarxiv icon

Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation

Add code
Dec 09, 2021
Figure 1 for Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation
Figure 2 for Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation
Figure 3 for Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation
Figure 4 for Error-Aware Imitation Learning from Teleoperation Data for Mobile Manipulation
Viaarxiv icon