Alert button

"speech recognition": models, code, and papers
Alert button

Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition

Add code
Bookmark button
Alert button
Nov 16, 2021
Yi-Chang Chen, Chun-Yen Cheng, Chien-An Chen, Ming-Chieh Sung, Yi-Ren Yeh

Viaarxiv icon

Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Mar 01, 2023
Philipp Klumpp, Pooja Chitkara, Leda Sarı, Prashant Serai, Jilong Wu, Irina-Elena Veliche, Rongqing Huang, Qing He

Figure 1 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Figure 2 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Figure 3 for Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition
Viaarxiv icon

Deep Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition

Feb 17, 2023
Yan Zhao, Jincen Wang, Yuan Zong, Wenming Zheng, Hailun Lian, Li Zhao

Figure 1 for Deep Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
Figure 2 for Deep Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
Figure 3 for Deep Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
Figure 4 for Deep Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition
Viaarxiv icon

Multilingual Word Error Rate Estimation: e-WER3

Apr 02, 2023
Shammur Absar Chowdhury, Ahmed Ali

Figure 1 for Multilingual Word Error Rate Estimation: e-WER3
Figure 2 for Multilingual Word Error Rate Estimation: e-WER3
Figure 3 for Multilingual Word Error Rate Estimation: e-WER3
Figure 4 for Multilingual Word Error Rate Estimation: e-WER3
Viaarxiv icon

Security and Privacy Problems in Voice Assistant Applications: A Survey

Apr 19, 2023
Jingjin Li, Chao chen, Lei Pan, Mostafa Rahimi Azghadi, Hossein Ghodosi, Jun Zhang

Figure 1 for Security and Privacy Problems in Voice Assistant Applications: A Survey
Figure 2 for Security and Privacy Problems in Voice Assistant Applications: A Survey
Figure 3 for Security and Privacy Problems in Voice Assistant Applications: A Survey
Figure 4 for Security and Privacy Problems in Voice Assistant Applications: A Survey
Viaarxiv icon

Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts

Add code
Bookmark button
Alert button
Sep 25, 2021
Raluca Alexandra Fetic, Mikkel Jordahn, Lucas Chaves Lima, Rasmus Arpe Fogh Egebæk, Martin Carsten Nielsen, Benjamin Biering, Lars Kai Hansen

Figure 1 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 2 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 3 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Figure 4 for Topic Model Robustness to Automatic Speech Recognition Errors in Podcast Transcripts
Viaarxiv icon

Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 09, 2022
Catalin Zorila, Rama Doddipatla

Figure 1 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 2 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 3 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Figure 4 for Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition
Viaarxiv icon

TransAudio: Towards the Transferable Adversarial Audio Attack via Learning Contextualized Perturbations

Add code
Bookmark button
Alert button
Mar 28, 2023
Qi Gege, Yuefeng Chen, Xiaofeng Mao, Yao Zhu, Binyuan Hui, Xiaodan Li, Rong Zhang, Hui Xue

Figure 1 for TransAudio: Towards the Transferable Adversarial Audio Attack via Learning Contextualized Perturbations
Figure 2 for TransAudio: Towards the Transferable Adversarial Audio Attack via Learning Contextualized Perturbations
Figure 3 for TransAudio: Towards the Transferable Adversarial Audio Attack via Learning Contextualized Perturbations
Figure 4 for TransAudio: Towards the Transferable Adversarial Audio Attack via Learning Contextualized Perturbations
Viaarxiv icon

Dynamic Chuck Convolution For Unified Streaming And Non-streaming Conformer ASR

Apr 18, 2023
Xilai Li, Goeric Huybrechts, Srikanth Ronanki, Jeff Farris, Sravan Bodapati

Figure 1 for Dynamic Chuck Convolution For Unified Streaming And Non-streaming Conformer ASR
Figure 2 for Dynamic Chuck Convolution For Unified Streaming And Non-streaming Conformer ASR
Figure 3 for Dynamic Chuck Convolution For Unified Streaming And Non-streaming Conformer ASR
Figure 4 for Dynamic Chuck Convolution For Unified Streaming And Non-streaming Conformer ASR
Viaarxiv icon

Low latency transformers for speech processing

Feb 27, 2023
Jianbo Ma, Siqi Pan, Deepak Chandran, Andrea Fanelli, Richard Cartwright

Figure 1 for Low latency transformers for speech processing
Figure 2 for Low latency transformers for speech processing
Figure 3 for Low latency transformers for speech processing
Figure 4 for Low latency transformers for speech processing
Viaarxiv icon