Picture for Vineet Gandhi

Vineet Gandhi

CVIT, IIIT Hyderabad

VELOCITI: Can Video-Language Models Bind Semantic Concepts through Time?

Add code
Jun 16, 2024
Viaarxiv icon

SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning

Add code
Feb 07, 2024
Figure 1 for SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning
Figure 2 for SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning
Figure 3 for SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning
Figure 4 for SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning
Viaarxiv icon

Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings

Add code
Nov 27, 2023
Figure 1 for Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Figure 2 for Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Figure 3 for Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Figure 4 for Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings
Viaarxiv icon

RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations

Add code
Jul 03, 2023
Figure 1 for RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations
Figure 2 for RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations
Figure 3 for RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations
Figure 4 for RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations
Viaarxiv icon

Instance-Level Semantic Maps for Vision Language Navigation

Add code
May 23, 2023
Figure 1 for Instance-Level Semantic Maps for Vision Language Navigation
Figure 2 for Instance-Level Semantic Maps for Vision Language Navigation
Figure 3 for Instance-Level Semantic Maps for Vision Language Navigation
Figure 4 for Instance-Level Semantic Maps for Vision Language Navigation
Viaarxiv icon

MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting

Add code
May 19, 2023
Figure 1 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 2 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 3 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Figure 4 for MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting
Viaarxiv icon

ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations

Add code
Mar 01, 2023
Figure 1 for ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations
Figure 2 for ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations
Figure 3 for ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations
Figure 4 for ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations
Viaarxiv icon

Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification

Add code
Feb 01, 2023
Viaarxiv icon

Ground then Navigate: Language-guided Navigation in Dynamic Scenes

Add code
Sep 24, 2022
Figure 1 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 2 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 3 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Figure 4 for Ground then Navigate: Language-guided Navigation in Dynamic Scenes
Viaarxiv icon

Grounding Linguistic Commands to Navigable Regions

Add code
Dec 24, 2021
Figure 1 for Grounding Linguistic Commands to Navigable Regions
Figure 2 for Grounding Linguistic Commands to Navigable Regions
Figure 3 for Grounding Linguistic Commands to Navigable Regions
Figure 4 for Grounding Linguistic Commands to Navigable Regions
Viaarxiv icon