Picture for Dinesh Manocha

Dinesh Manocha

Text Prompting for Multi-Concept Video Customization by Autoregressive Generation

Add code
May 22, 2024
Viaarxiv icon

LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation

Add code
May 08, 2024
Figure 1 for LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation
Figure 2 for LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation
Figure 3 for LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation
Figure 4 for LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation
Viaarxiv icon

S-EQA: Tackling Situational Queries in Embodied Question Answering

Add code
May 08, 2024
Viaarxiv icon

TK-Planes: Tiered K-Planes with High Dimensional Feature Vectors for Dynamic UAV-based Scenes

Add code
May 04, 2024
Viaarxiv icon

"Don't forget to put the milk back!" Dataset for Enabling Embodied Agents to Detect Anomalous Situations

Add code
Apr 12, 2024
Viaarxiv icon

AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales

Add code
Apr 04, 2024
Figure 1 for AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
Figure 2 for AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
Figure 3 for AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
Figure 4 for AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales
Viaarxiv icon

PoCo: Point Context Cluster for RGBD Indoor Place Recognition

Add code
Apr 03, 2024
Viaarxiv icon

Can LLMs Generate Human-Like Wayfinding Instructions? Towards Platform-Agnostic Embodied Instruction Synthesis

Add code
Mar 31, 2024
Viaarxiv icon

CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP

Add code
Mar 30, 2024
Viaarxiv icon

Do Vision-Language Models Understand Compound Nouns?

Add code
Mar 30, 2024
Figure 1 for Do Vision-Language Models Understand Compound Nouns?
Figure 2 for Do Vision-Language Models Understand Compound Nouns?
Figure 3 for Do Vision-Language Models Understand Compound Nouns?
Figure 4 for Do Vision-Language Models Understand Compound Nouns?
Viaarxiv icon