Picture for Roozbeh Mottaghi

Roozbeh Mottaghi

Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks

Add code
Jun 17, 2022
Figure 1 for Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Figure 2 for Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Figure 3 for Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Figure 4 for Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks
Viaarxiv icon

What do navigation agents learn about their environment?

Add code
Jun 17, 2022
Figure 1 for What do navigation agents learn about their environment?
Figure 2 for What do navigation agents learn about their environment?
Figure 3 for What do navigation agents learn about their environment?
Figure 4 for What do navigation agents learn about their environment?
Viaarxiv icon

ProcTHOR: Large-Scale Embodied AI Using Procedural Generation

Add code
Jun 14, 2022
Figure 1 for ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Figure 2 for ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Figure 3 for ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Figure 4 for ProcTHOR: Large-Scale Embodied AI Using Procedural Generation
Viaarxiv icon

A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge

Add code
Jun 03, 2022
Figure 1 for A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge
Figure 2 for A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge
Figure 3 for A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge
Figure 4 for A-OKVQA: A Benchmark for Visual Question Answering using World Knowledge
Viaarxiv icon

Continuous Scene Representations for Embodied AI

Add code
Mar 31, 2022
Figure 1 for Continuous Scene Representations for Embodied AI
Figure 2 for Continuous Scene Representations for Embodied AI
Figure 3 for Continuous Scene Representations for Embodied AI
Figure 4 for Continuous Scene Representations for Embodied AI
Viaarxiv icon

Object Manipulation via Visual Target Localization

Add code
Mar 15, 2022
Figure 1 for Object Manipulation via Visual Target Localization
Figure 2 for Object Manipulation via Visual Target Localization
Figure 3 for Object Manipulation via Visual Target Localization
Figure 4 for Object Manipulation via Visual Target Localization
Viaarxiv icon

ASC me to Do Anything: Multi-task Training for Embodied AI

Add code
Feb 14, 2022
Figure 1 for ASC me to Do Anything: Multi-task Training for Embodied AI
Figure 2 for ASC me to Do Anything: Multi-task Training for Embodied AI
Figure 3 for ASC me to Do Anything: Multi-task Training for Embodied AI
Figure 4 for ASC me to Do Anything: Multi-task Training for Embodied AI
Viaarxiv icon

Interactron: Embodied Adaptive Object Detection

Add code
Feb 01, 2022
Figure 1 for Interactron: Embodied Adaptive Object Detection
Figure 2 for Interactron: Embodied Adaptive Object Detection
Figure 3 for Interactron: Embodied Adaptive Object Detection
Figure 4 for Interactron: Embodied Adaptive Object Detection
Viaarxiv icon

Simple but Effective: CLIP Embeddings for Embodied AI

Add code
Nov 18, 2021
Figure 1 for Simple but Effective: CLIP Embeddings for Embodied AI
Figure 2 for Simple but Effective: CLIP Embeddings for Embodied AI
Figure 3 for Simple but Effective: CLIP Embeddings for Embodied AI
Figure 4 for Simple but Effective: CLIP Embeddings for Embodied AI
Viaarxiv icon

CORA: Benchmarks, Baselines, and Metrics as a Platform for Continual Reinforcement Learning Agents

Add code
Oct 19, 2021
Viaarxiv icon