Alert button

"speech recognition": models, code, and papers
Alert button

Robust Over-the-Air Adversarial Examples Against Automatic Speech Recognition Systems

Aug 05, 2019
Lea Schönherr, Steffen Zeiler, Thorsten Holz, Dorothea Kolossa

Figure 1 for Robust Over-the-Air Adversarial Examples Against Automatic Speech Recognition Systems
Figure 2 for Robust Over-the-Air Adversarial Examples Against Automatic Speech Recognition Systems
Figure 3 for Robust Over-the-Air Adversarial Examples Against Automatic Speech Recognition Systems
Figure 4 for Robust Over-the-Air Adversarial Examples Against Automatic Speech Recognition Systems
Viaarxiv icon

Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models

Feb 26, 2022
Samuel Thomas, Brian Kingsbury, George Saon, Hong-Kwang J. Kuo

Figure 1 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 2 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 3 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 4 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Viaarxiv icon

Pre-training on high-resource speech recognition improves low-resource speech-to-text translation

Add code
Bookmark button
Alert button
Sep 05, 2018
Sameer Bansal, Herman Kamper, Karen Livescu, Adam Lopez, Sharon Goldwater

Figure 1 for Pre-training on high-resource speech recognition improves low-resource speech-to-text translation
Figure 2 for Pre-training on high-resource speech recognition improves low-resource speech-to-text translation
Figure 3 for Pre-training on high-resource speech recognition improves low-resource speech-to-text translation
Figure 4 for Pre-training on high-resource speech recognition improves low-resource speech-to-text translation
Viaarxiv icon

Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training

Jul 06, 2019
Astik Biswas, Raghav Menon, Ewald van der Westhuizen, Thomas Niesler

Figure 1 for Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training
Figure 2 for Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training
Figure 3 for Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training
Figure 4 for Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training
Viaarxiv icon

Adversarial Black-Box Attacks for Automatic Speech Recognition Systems Using Multi-Objective Genetic Optimization

Add code
Bookmark button
Alert button
Nov 04, 2018
Shreya Khare, Rahul Aralikatte, Senthil Mani

Figure 1 for Adversarial Black-Box Attacks for Automatic Speech Recognition Systems Using Multi-Objective Genetic Optimization
Figure 2 for Adversarial Black-Box Attacks for Automatic Speech Recognition Systems Using Multi-Objective Genetic Optimization
Figure 3 for Adversarial Black-Box Attacks for Automatic Speech Recognition Systems Using Multi-Objective Genetic Optimization
Figure 4 for Adversarial Black-Box Attacks for Automatic Speech Recognition Systems Using Multi-Objective Genetic Optimization
Viaarxiv icon

Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding

Add code
Bookmark button
Alert button
Feb 10, 2022
Peter Sullivan, Toshiko Shibano, Muhammad Abdul-Mageed

Figure 1 for Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding
Figure 2 for Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding
Figure 3 for Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding
Figure 4 for Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding
Viaarxiv icon

The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge

Feb 04, 2022
Naijun Zheng, Na Li, Xixin Wu, Lingwei Meng, Jiawen Kang, Haibin Wu, Chao Weng, Dan Su, Helen Meng

Figure 1 for The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Figure 2 for The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Figure 3 for The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge
Viaarxiv icon

Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jun 20, 2018
Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges Linarès, Renato De Mori, Yoshua Bengio

Figure 1 for Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition
Figure 2 for Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition
Viaarxiv icon

Twitter Dataset on the Russo-Ukrainian War

Add code
Bookmark button
Alert button
Apr 07, 2022
Alexander Shevtsov, Christos Tzagkarakis, Despoina Antonakaki, Polyvios Pratikakis, Sotiris Ioannidis

Figure 1 for Twitter Dataset on the Russo-Ukrainian War
Figure 2 for Twitter Dataset on the Russo-Ukrainian War
Figure 3 for Twitter Dataset on the Russo-Ukrainian War
Figure 4 for Twitter Dataset on the Russo-Ukrainian War
Viaarxiv icon

A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition

Apr 17, 2019
Jiabin Xue, Jiqing Han, Tieran Zheng, Xiang Gao, Jiaxing Guo

Figure 1 for A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition
Figure 2 for A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition
Figure 3 for A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition
Figure 4 for A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition
Viaarxiv icon