Musan


Noise-Robust AV-ASR Using Visual Features Both in the Whisper Encoder and Decoder

Add code
Jan 26, 2026
Viaarxiv icon

Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection

Add code
Sep 08, 2024
Figure 1 for Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection
Figure 2 for Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection
Figure 3 for Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection
Figure 4 for Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection
Viaarxiv icon

On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition

Add code
May 25, 2023
Figure 1 for On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition
Figure 2 for On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition
Figure 3 for On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition
Viaarxiv icon

LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge

Add code
Dec 19, 2019
Figure 1 for LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge
Figure 2 for LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge
Figure 3 for LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge
Figure 4 for LSTM-TDNN with convolutional front-end for Dialect Identification in the 2019 Multi-Genre Broadcast Challenge
Viaarxiv icon

SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation

Add code
Dec 01, 2018
Figure 1 for SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation
Figure 2 for SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation
Figure 3 for SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation
Figure 4 for SwishNet: A Fast Convolutional Neural Network for Speech, Music and Noise Classification and Segmentation
Viaarxiv icon