Alert button

"speech recognition": models, code, and papers
Alert button

Adversarial Attacks on ASR Systems: An Overview

Aug 03, 2022
Xiao Zhang, Hao Tan, Xuan Huang, Denghui Zhang, Keke Tang, Zhaoquan Gu

Figure 1 for Adversarial Attacks on ASR Systems: An Overview
Figure 2 for Adversarial Attacks on ASR Systems: An Overview
Figure 3 for Adversarial Attacks on ASR Systems: An Overview
Viaarxiv icon

Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition

Jun 01, 2020
Sanket Shah, Basil Abraham, Gurunath Reddy M, Sunayana Sitaram, Vikas Joshi

Figure 1 for Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Figure 2 for Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Figure 3 for Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Figure 4 for Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition
Viaarxiv icon

Exploring linguistic feature and model combination for speech recognition based automatic AD detection

Jun 28, 2022
Yi Wang, Tianzi Wang, Zi Ye, Lingwei Meng, Shoukang Hu, Xixin Wu, Xunying Liu, Helen Meng

Figure 1 for Exploring linguistic feature and model combination for speech recognition based automatic AD detection
Figure 2 for Exploring linguistic feature and model combination for speech recognition based automatic AD detection
Figure 3 for Exploring linguistic feature and model combination for speech recognition based automatic AD detection
Figure 4 for Exploring linguistic feature and model combination for speech recognition based automatic AD detection
Viaarxiv icon

Transfer Learning based Speech Affect Recognition in Urdu

Mar 05, 2021
Sara Durrani, Muhammad Umair Arshad

Figure 1 for Transfer Learning based Speech Affect Recognition in Urdu
Figure 2 for Transfer Learning based Speech Affect Recognition in Urdu
Figure 3 for Transfer Learning based Speech Affect Recognition in Urdu
Figure 4 for Transfer Learning based Speech Affect Recognition in Urdu
Viaarxiv icon

Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment

Add code
Bookmark button
Alert button
May 06, 2022
Yuan Gong, Ziyi Chen, Iek-Heng Chu, Peng Chang, James Glass

Figure 1 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 2 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 3 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Figure 4 for Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment
Viaarxiv icon

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

Add code
Bookmark button
Alert button
Feb 20, 2023
Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman

Figure 1 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 2 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 3 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 4 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Viaarxiv icon

Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding

Apr 11, 2022
Sanjana Sankar, Denis Beautemps, Thomas Hueber

Figure 1 for Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
Figure 2 for Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
Figure 3 for Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding
Viaarxiv icon

Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning

May 27, 2022
Xiliang Zhu, Shayna Gardiner, David Rossouw, Tere Roldán, Simon Corston-Oliver

Figure 1 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 2 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 3 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Figure 4 for Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning
Viaarxiv icon

cross-modal fusion techniques for utterance-level emotion recognition from text and speech

Feb 05, 2023
Jiachen Luo, Huy Phan, Joshua Reiss

Figure 1 for cross-modal fusion techniques for utterance-level emotion recognition from text and speech
Figure 2 for cross-modal fusion techniques for utterance-level emotion recognition from text and speech
Figure 3 for cross-modal fusion techniques for utterance-level emotion recognition from text and speech
Figure 4 for cross-modal fusion techniques for utterance-level emotion recognition from text and speech
Viaarxiv icon

Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition

Jun 11, 2019
Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 2 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 3 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 4 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Viaarxiv icon