Alert button

"speech recognition": models, code, and papers
Alert button

Regeneration Learning: A Learning Paradigm for Data Generation

Jan 21, 2023
Xu Tan, Tao Qin, Jiang Bian, Tie-Yan Liu, Yoshua Bengio

Figure 1 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 2 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 3 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 4 for Regeneration Learning: A Learning Paradigm for Data Generation
Viaarxiv icon

Configurable Privacy-Preserving Automatic Speech Recognition

Add code
Bookmark button
Alert button
Apr 01, 2021
Ranya Aloufi, Hamed Haddadi, David Boyle

Figure 1 for Configurable Privacy-Preserving Automatic Speech Recognition
Figure 2 for Configurable Privacy-Preserving Automatic Speech Recognition
Figure 3 for Configurable Privacy-Preserving Automatic Speech Recognition
Figure 4 for Configurable Privacy-Preserving Automatic Speech Recognition
Viaarxiv icon

Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study

Add code
Bookmark button
Alert button
Mar 31, 2022
Keyu An, Zhijian Ou

Figure 1 for Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study
Figure 2 for Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study
Figure 3 for Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study
Figure 4 for Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study
Viaarxiv icon

Pronunciation Modeling of Foreign Words for Mandarin ASR by Considering the Effect of Language Transfer

Oct 07, 2022
Lei Wang, Rong Tong

Figure 1 for Pronunciation Modeling of Foreign Words for Mandarin ASR by Considering the Effect of Language Transfer
Figure 2 for Pronunciation Modeling of Foreign Words for Mandarin ASR by Considering the Effect of Language Transfer
Figure 3 for Pronunciation Modeling of Foreign Words for Mandarin ASR by Considering the Effect of Language Transfer
Figure 4 for Pronunciation Modeling of Foreign Words for Mandarin ASR by Considering the Effect of Language Transfer
Viaarxiv icon

Quantifying Bias in Automatic Speech Recognition

Add code
Bookmark button
Alert button
Apr 01, 2021
Siyuan Feng, Olya Kudina, Bence Mark Halpern, Odette Scharenborg

Figure 1 for Quantifying Bias in Automatic Speech Recognition
Figure 2 for Quantifying Bias in Automatic Speech Recognition
Figure 3 for Quantifying Bias in Automatic Speech Recognition
Figure 4 for Quantifying Bias in Automatic Speech Recognition
Viaarxiv icon

Conformer: Convolution-augmented Transformer for Speech Recognition

Add code
Bookmark button
Alert button
May 16, 2020
Anmol Gulati, James Qin, Chung-Cheng Chiu, Niki Parmar, Yu Zhang, Jiahui Yu, Wei Han, Shibo Wang, Zhengdong Zhang, Yonghui Wu, Ruoming Pang

Figure 1 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 2 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 3 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 4 for Conformer: Convolution-augmented Transformer for Speech Recognition
Viaarxiv icon

Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence

Apr 18, 2023
Yicheng Hsu, Mingsian R. Bai

Figure 1 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 2 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 3 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 4 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Viaarxiv icon

Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition

Dec 23, 2021
Changfeng Gao, Gaofeng Cheng, Yifan Guo, Qingwei Zhao, Pengyuan Zhang

Figure 1 for Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition
Figure 2 for Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition
Figure 3 for Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition
Figure 4 for Data Augmentation based Consistency Contrastive Pre-training for Automatic Speech Recognition
Viaarxiv icon

Attention based end to end Speech Recognition for Voice Search in Hindi and English

Nov 15, 2021
Raviraj Joshi, Venkateshan Kannan

Figure 1 for Attention based end to end Speech Recognition for Voice Search in Hindi and English
Figure 2 for Attention based end to end Speech Recognition for Voice Search in Hindi and English
Viaarxiv icon

Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems

Dec 03, 2021
Xiaoliang Wu, Ajitha Rajan

Figure 1 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 2 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 3 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Figure 4 for Blackbox Untargeted Adversarial Testing of Automatic Speech Recognition Systems
Viaarxiv icon