Picture for Jianhua Tao

Jianhua Tao

HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition

Add code
Jan 11, 2024
Figure 1 for HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
Figure 2 for HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
Figure 3 for HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
Figure 4 for HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition
Viaarxiv icon

SVFAP: Self-supervised Video Facial Affect Perceiver

Add code
Dec 31, 2023
Figure 1 for SVFAP: Self-supervised Video Facial Affect Perceiver
Figure 2 for SVFAP: Self-supervised Video Facial Affect Perceiver
Figure 3 for SVFAP: Self-supervised Video Facial Affect Perceiver
Figure 4 for SVFAP: Self-supervised Video Facial Affect Perceiver
Viaarxiv icon

What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection

Add code
Dec 15, 2023
Viaarxiv icon

GPT-4V with Emotion: A Zero-shot Benchmark for Multimodal Emotion Understanding

Add code
Dec 07, 2023
Figure 1 for GPT-4V with Emotion: A Zero-shot Benchmark for Multimodal Emotion Understanding
Figure 2 for GPT-4V with Emotion: A Zero-shot Benchmark for Multimodal Emotion Understanding
Figure 3 for GPT-4V with Emotion: A Zero-shot Benchmark for Multimodal Emotion Understanding
Figure 4 for GPT-4V with Emotion: A Zero-shot Benchmark for Multimodal Emotion Understanding
Viaarxiv icon

Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection

Add code
Oct 13, 2023
Figure 1 for Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection
Figure 2 for Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection
Figure 3 for Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection
Figure 4 for Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection
Viaarxiv icon

Controllable Residual Speaker Representation for Voice Conversion

Add code
Sep 15, 2023
Viaarxiv icon

Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms

Add code
Sep 13, 2023
Figure 1 for Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms
Figure 2 for Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms
Figure 3 for Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms
Figure 4 for Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms
Viaarxiv icon

DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection

Add code
Sep 07, 2023
Figure 1 for DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection
Figure 2 for DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection
Figure 3 for DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection
Figure 4 for DGSD: Dynamical Graph Self-Distillation for EEG-Based Auditory Spatial Attention Detection
Viaarxiv icon

Audio Deepfake Detection: A Survey

Add code
Aug 29, 2023
Figure 1 for Audio Deepfake Detection: A Survey
Figure 2 for Audio Deepfake Detection: A Survey
Figure 3 for Audio Deepfake Detection: A Survey
Figure 4 for Audio Deepfake Detection: A Survey
Viaarxiv icon

Spatial Reconstructed Local Attention Res2Net with F0 Subband for Fake Speech Detection

Add code
Aug 19, 2023
Viaarxiv icon