Alert button

"speech recognition": models, code, and papers
Alert button

Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition

May 11, 2020
Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang

Figure 1 for Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
Figure 2 for Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
Figure 3 for Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
Figure 4 for Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
Viaarxiv icon

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss

Add code
Bookmark button
Alert button
Feb 14, 2020
Qian Zhang, Han Lu, Hasim Sak, Anshuman Tripathi, Erik McDermott, Stephen Koo, Shankar Kumar

Figure 1 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 2 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 3 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Figure 4 for Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss
Viaarxiv icon

Universal Fourier Attack for Time Series

Sep 02, 2022
Elizabeth Coda, Brad Clymer, Chance DeSmet, Yijing Watkins, Michael Girard

Figure 1 for Universal Fourier Attack for Time Series
Figure 2 for Universal Fourier Attack for Time Series
Figure 3 for Universal Fourier Attack for Time Series
Figure 4 for Universal Fourier Attack for Time Series
Viaarxiv icon

FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 20, 2021
Yichong Leng, Xu Tan, Linchen Zhu, Jin Xu, Renqian Luo, Linquan Liu, Tao Qin, Xiang-Yang Li, Ed Lin, Tie-Yan Liu

Figure 1 for FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Figure 2 for FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Figure 3 for FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Figure 4 for FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Viaarxiv icon

A network of deep neural networks for distant speech recognition

Mar 23, 2017
Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Figure 1 for A network of deep neural networks for distant speech recognition
Figure 2 for A network of deep neural networks for distant speech recognition
Figure 3 for A network of deep neural networks for distant speech recognition
Figure 4 for A network of deep neural networks for distant speech recognition
Viaarxiv icon

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline

Add code
Bookmark button
Alert button
Sep 22, 2020
Yerbolat Khassanov, Saida Mussakhojayeva, Almas Mirzakhmetov, Alen Adiyev, Mukhamet Nurpeiissov, Huseyin Atakan Varol

Figure 1 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 2 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 3 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 4 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Viaarxiv icon

Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition

Jun 04, 2021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong

Figure 1 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 2 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Figure 3 for Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
Viaarxiv icon

DNN-Based Semantic Model for Rescoring N-best Speech Recognition List

Nov 02, 2020
Dominique Fohr, Irina Illina

Figure 1 for DNN-Based Semantic Model for Rescoring N-best Speech Recognition List
Figure 2 for DNN-Based Semantic Model for Rescoring N-best Speech Recognition List
Figure 3 for DNN-Based Semantic Model for Rescoring N-best Speech Recognition List
Figure 4 for DNN-Based Semantic Model for Rescoring N-best Speech Recognition List
Viaarxiv icon

Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition

Nov 15, 2017
Shankar Kumar, Michael Nirschl, Daniel Holtmann-Rice, Hank Liao, Ananda Theertha Suresh, Felix Yu

Figure 1 for Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition
Figure 2 for Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition
Figure 3 for Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition
Figure 4 for Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition
Viaarxiv icon

Speech Enhancement Modeling Towards Robust Speech Recognition System

May 07, 2013
Urmila Shrawankar, V. M. Thakare

Viaarxiv icon