Alert button

"speech recognition": models, code, and papers
Alert button

Non-Parallel Voice Conversion for ASR Augmentation

Sep 15, 2022
Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Yinghui Huang, Jesse Emond, Pedro Moreno Mengibar

Figure 1 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 2 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 3 for Non-Parallel Voice Conversion for ASR Augmentation
Figure 4 for Non-Parallel Voice Conversion for ASR Augmentation
Viaarxiv icon

The THUEE System Description for the IARPA OpenASR21 Challenge

Add code
Bookmark button
Alert button
Jun 29, 2022
Jing Zhao, Haoyu Wang, Jinpeng Li, Shuzhou Chai, Guan-Bo Wang, Guoguo Chen, Wei-Qiang Zhang

Figure 1 for The THUEE System Description for the IARPA OpenASR21 Challenge
Figure 2 for The THUEE System Description for the IARPA OpenASR21 Challenge
Figure 3 for The THUEE System Description for the IARPA OpenASR21 Challenge
Figure 4 for The THUEE System Description for the IARPA OpenASR21 Challenge
Viaarxiv icon

Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding

Oct 16, 2022
Ruchao Fan, Guoli Ye, Yashesh Gaur, Jinyu Li

Figure 1 for Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Figure 2 for Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Figure 3 for Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Figure 4 for Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding
Viaarxiv icon

Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition

May 01, 2020
Hu Hu, Rui Zhao, Jinyu Li, Liang Lu, Yifan Gong

Figure 1 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 2 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 3 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Figure 4 for Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition
Viaarxiv icon

Speech Recognition: Keyword Spotting Through Image Recognition

Mar 10, 2018
Sanjay Krishna Gouda, Salil Kanetkar, David Harrison, Manfred K Warmuth

Figure 1 for Speech Recognition: Keyword Spotting Through Image Recognition
Figure 2 for Speech Recognition: Keyword Spotting Through Image Recognition
Figure 3 for Speech Recognition: Keyword Spotting Through Image Recognition
Figure 4 for Speech Recognition: Keyword Spotting Through Image Recognition
Viaarxiv icon

The Microsoft 2017 Conversational Speech Recognition System

Aug 24, 2017
W. Xiong, L. Wu, F. Alleva, J. Droppo, X. Huang, A. Stolcke

Figure 1 for The Microsoft 2017 Conversational Speech Recognition System
Figure 2 for The Microsoft 2017 Conversational Speech Recognition System
Figure 3 for The Microsoft 2017 Conversational Speech Recognition System
Viaarxiv icon

Attention-Based End-to-End Speech Recognition on Voice Search

Feb 13, 2018
Changhao Shan, Junbo Zhang, Yujun Wang, Lei Xie

Figure 1 for Attention-Based End-to-End Speech Recognition on Voice Search
Figure 2 for Attention-Based End-to-End Speech Recognition on Voice Search
Figure 3 for Attention-Based End-to-End Speech Recognition on Voice Search
Figure 4 for Attention-Based End-to-End Speech Recognition on Voice Search
Viaarxiv icon

Visual Speech Recognition

Add code
Bookmark button
Alert button
Sep 03, 2014
Ahmad B. A. Hassanat

Figure 1 for Visual Speech Recognition
Figure 2 for Visual Speech Recognition
Figure 3 for Visual Speech Recognition
Figure 4 for Visual Speech Recognition
Viaarxiv icon

Automatic context window composition for distant speech recognition

May 26, 2018
Mirco Ravanelli, Maurizio Omologo

Figure 1 for Automatic context window composition for distant speech recognition
Figure 2 for Automatic context window composition for distant speech recognition
Figure 3 for Automatic context window composition for distant speech recognition
Figure 4 for Automatic context window composition for distant speech recognition
Viaarxiv icon

Speech Recognition and Multi-Speaker Diarization of Long Conversations

Add code
Bookmark button
Alert button
May 16, 2020
Huanru Henry Mao, Shuyang Li, Julian McAuley, Garrison Cottrell

Figure 1 for Speech Recognition and Multi-Speaker Diarization of Long Conversations
Figure 2 for Speech Recognition and Multi-Speaker Diarization of Long Conversations
Figure 3 for Speech Recognition and Multi-Speaker Diarization of Long Conversations
Viaarxiv icon