Picture for Jahangir Alam

Jahangir Alam

United we stand, Divided we fall: Handling Weak Complementary Relationships for Audio-Visual Emotion Recognition in Valence-Arousal Space

Add code
Mar 21, 2025
Viaarxiv icon

Handling Weak Complementary Relationships for Audio-Visual Emotion Recognition

Add code
Mar 15, 2025
Viaarxiv icon

Inconsistency-Aware Cross-Attention for Audio-Visual Fusion in Dimensional Emotion Recognition

Add code
May 21, 2024
Figure 1 for Inconsistency-Aware Cross-Attention for Audio-Visual Fusion in Dimensional Emotion Recognition
Figure 2 for Inconsistency-Aware Cross-Attention for Audio-Visual Fusion in Dimensional Emotion Recognition
Figure 3 for Inconsistency-Aware Cross-Attention for Audio-Visual Fusion in Dimensional Emotion Recognition
Figure 4 for Inconsistency-Aware Cross-Attention for Audio-Visual Fusion in Dimensional Emotion Recognition
Viaarxiv icon

Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition

Add code
Mar 30, 2024
Figure 1 for Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition
Figure 2 for Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition
Figure 3 for Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition
Figure 4 for Recursive Joint Cross-Modal Attention for Multimodal Fusion in Dimensional Emotion Recognition
Viaarxiv icon

Cross-Attention is Not Always Needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition

Add code
Mar 28, 2024
Figure 1 for Cross-Attention is Not Always Needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition
Figure 2 for Cross-Attention is Not Always Needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition
Figure 3 for Cross-Attention is Not Always Needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition
Figure 4 for Cross-Attention is Not Always Needed: Dynamic Cross-Attention for Audio-Visual Dimensional Emotion Recognition
Viaarxiv icon

Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention

Add code
Mar 12, 2024
Figure 1 for Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention
Figure 2 for Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention
Figure 3 for Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention
Figure 4 for Audio-Visual Person Verification based on Recursive Fusion of Joint Cross-Attention
Viaarxiv icon

Dynamic Cross Attention for Audio-Visual Person Verification

Add code
Mar 12, 2024
Figure 1 for Dynamic Cross Attention for Audio-Visual Person Verification
Figure 2 for Dynamic Cross Attention for Audio-Visual Person Verification
Figure 3 for Dynamic Cross Attention for Audio-Visual Person Verification
Viaarxiv icon

Audio-Visual Speaker Verification via Joint Cross-Attention

Add code
Sep 28, 2023
Figure 1 for Audio-Visual Speaker Verification via Joint Cross-Attention
Figure 2 for Audio-Visual Speaker Verification via Joint Cross-Attention
Figure 3 for Audio-Visual Speaker Verification via Joint Cross-Attention
Viaarxiv icon

Attentive activation function for improving end-to-end spoofing countermeasure systems

Add code
May 03, 2022
Figure 1 for Attentive activation function for improving end-to-end spoofing countermeasure systems
Figure 2 for Attentive activation function for improving end-to-end spoofing countermeasure systems
Figure 3 for Attentive activation function for improving end-to-end spoofing countermeasure systems
Figure 4 for Attentive activation function for improving end-to-end spoofing countermeasure systems
Viaarxiv icon

Robust Speech Representation Learning via Flow-based Embedding Regularization

Add code
Dec 07, 2021
Figure 1 for Robust Speech Representation Learning via Flow-based Embedding Regularization
Figure 2 for Robust Speech Representation Learning via Flow-based Embedding Regularization
Figure 3 for Robust Speech Representation Learning via Flow-based Embedding Regularization
Figure 4 for Robust Speech Representation Learning via Flow-based Embedding Regularization
Viaarxiv icon