Picture for Sanath Narayan

Sanath Narayan

Open-Vocabulary Temporal Action Localization using Multimodal Guidance

Add code
Jun 21, 2024
Viaarxiv icon

Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning

Add code
Jun 06, 2024
Figure 1 for Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning
Figure 2 for Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning
Viaarxiv icon

Multi-modal Generation via Cross-Modal In-Context Learning

Add code
May 28, 2024
Figure 1 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 2 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 3 for Multi-modal Generation via Cross-Modal In-Context Learning
Figure 4 for Multi-modal Generation via Cross-Modal In-Context Learning
Viaarxiv icon

ViSpeR: Multilingual Audio-Visual Speech Recognition

Add code
May 27, 2024
Viaarxiv icon

Do Vision and Language Encoders Represent the World Similarly?

Add code
Jan 10, 2024
Viaarxiv icon

Do VSR Models Generalize Beyond LRS3?

Add code
Nov 23, 2023
Figure 1 for Do VSR Models Generalize Beyond LRS3?
Figure 2 for Do VSR Models Generalize Beyond LRS3?
Figure 3 for Do VSR Models Generalize Beyond LRS3?
Figure 4 for Do VSR Models Generalize Beyond LRS3?
Viaarxiv icon

Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping

Add code
Aug 11, 2023
Figure 1 for Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping
Figure 2 for Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping
Figure 3 for Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping
Figure 4 for Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping
Viaarxiv icon

Remote Sensing Change Detection With Transformers Trained from Scratch

Add code
Apr 13, 2023
Figure 1 for Remote Sensing Change Detection With Transformers Trained from Scratch
Figure 2 for Remote Sensing Change Detection With Transformers Trained from Scratch
Figure 3 for Remote Sensing Change Detection With Transformers Trained from Scratch
Figure 4 for Remote Sensing Change Detection With Transformers Trained from Scratch
Viaarxiv icon

Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification

Add code
Apr 04, 2023
Figure 1 for Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification
Figure 2 for Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification
Figure 3 for Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification
Viaarxiv icon

Video Instance Segmentation in an Open-World

Add code
Apr 03, 2023
Figure 1 for Video Instance Segmentation in an Open-World
Figure 2 for Video Instance Segmentation in an Open-World
Figure 3 for Video Instance Segmentation in an Open-World
Figure 4 for Video Instance Segmentation in an Open-World
Viaarxiv icon