Picture for Shah Nawaz

Shah Nawaz

An Effective Training Framework for Light-Weight Automatic Speech Recognition Models

Add code
May 22, 2025
Viaarxiv icon

PAEFF: Precise Alignment and Enhanced Gated Feature Fusion for Face-Voice Association

Add code
May 22, 2025
Viaarxiv icon

A Multimodal Single-Branch Embedding Network for Recommendation in Cold-Start and Missing Modality Scenarios

Add code
Sep 26, 2024
Viaarxiv icon

Modality Invariant Multimodal Learning to Handle Missing Modalities: A Single-Branch Approach

Add code
Aug 14, 2024
Figure 1 for Modality Invariant Multimodal Learning to Handle Missing Modalities: A Single-Branch Approach
Figure 2 for Modality Invariant Multimodal Learning to Handle Missing Modalities: A Single-Branch Approach
Figure 3 for Modality Invariant Multimodal Learning to Handle Missing Modalities: A Single-Branch Approach
Figure 4 for Modality Invariant Multimodal Learning to Handle Missing Modalities: A Single-Branch Approach
Viaarxiv icon

Chameleon: Images Are What You Need For Multimodal Learning Robust To Missing Modalities

Add code
Jul 23, 2024
Viaarxiv icon

Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan

Add code
Apr 16, 2024
Viaarxiv icon

Frame-to-Utterance Convergence: A Spectra-Temporal Approach for Unified Spoofing Detection

Add code
Sep 18, 2023
Viaarxiv icon

DCTM: Dilated Convolutional Transformer Model for Multimodal Engagement Estimation in Conversation

Add code
Jul 31, 2023
Viaarxiv icon

Single-branch Network for Multimodal Training

Add code
Mar 10, 2023
Viaarxiv icon

Speaker Recognition in Realistic Scenario Using Multimodal Data

Add code
Feb 25, 2023
Viaarxiv icon