Alert button

"speech recognition": models, code, and papers
Alert button

TS-RIR: Translated synthetic room impulse responses for speech augmentation

Add code
Bookmark button
Alert button
Apr 03, 2021
Anton Ratnarajah, Zhenyu Tang, Dinesh Manocha

Figure 1 for TS-RIR: Translated synthetic room impulse responses for speech augmentation
Figure 2 for TS-RIR: Translated synthetic room impulse responses for speech augmentation
Figure 3 for TS-RIR: Translated synthetic room impulse responses for speech augmentation
Figure 4 for TS-RIR: Translated synthetic room impulse responses for speech augmentation
Viaarxiv icon

Lattention: Lattice-attention in ASR rescoring

Nov 19, 2021
Prabhat Pandey, Sergio Duarte Torres, Ali Orkan Bayer, Ankur Gandhe, Volker Leutnant

Figure 1 for Lattention: Lattice-attention in ASR rescoring
Figure 2 for Lattention: Lattice-attention in ASR rescoring
Figure 3 for Lattention: Lattice-attention in ASR rescoring
Figure 4 for Lattention: Lattice-attention in ASR rescoring
Viaarxiv icon

Conformer-based Hybrid ASR System for Switchboard Dataset

Add code
Bookmark button
Alert button
Nov 05, 2021
Mohammad Zeineldeen, Jingjing Xu, Christoph Lüscher, Wilfried Michel, Alexander Gerstenberger, Ralf Schlüter, Hermann Ney

Figure 1 for Conformer-based Hybrid ASR System for Switchboard Dataset
Figure 2 for Conformer-based Hybrid ASR System for Switchboard Dataset
Figure 3 for Conformer-based Hybrid ASR System for Switchboard Dataset
Figure 4 for Conformer-based Hybrid ASR System for Switchboard Dataset
Viaarxiv icon

Highway Long Short-Term Memory RNNs for Distant Speech Recognition

Jan 11, 2016
Yu Zhang, Guoguo Chen, Dong Yu, Kaisheng Yao, Sanjeev Khudanpur, James Glass

Figure 1 for Highway Long Short-Term Memory RNNs for Distant Speech Recognition
Figure 2 for Highway Long Short-Term Memory RNNs for Distant Speech Recognition
Figure 3 for Highway Long Short-Term Memory RNNs for Distant Speech Recognition
Figure 4 for Highway Long Short-Term Memory RNNs for Distant Speech Recognition
Viaarxiv icon

Ask2Mask: Guided Data Selection for Masked Speech Modeling

Feb 24, 2022
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro Moreno

Figure 1 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 2 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 3 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Figure 4 for Ask2Mask: Guided Data Selection for Masked Speech Modeling
Viaarxiv icon

power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition

Dec 22, 2019
Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda

Figure 1 for power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition
Figure 2 for power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition
Figure 3 for power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition
Figure 4 for power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition
Viaarxiv icon

Use of Machine Learning Technique to maximize the signal over background for $H \rightarrow ττ$

Jun 27, 2021
Kanhaiya Gupta

Figure 1 for Use of Machine Learning Technique to maximize the signal over background for $H \rightarrow ττ$
Figure 2 for Use of Machine Learning Technique to maximize the signal over background for $H \rightarrow ττ$
Figure 3 for Use of Machine Learning Technique to maximize the signal over background for $H \rightarrow ττ$
Figure 4 for Use of Machine Learning Technique to maximize the signal over background for $H \rightarrow ττ$
Viaarxiv icon

Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese

Add code
Bookmark button
Alert button
Jun 04, 2018
Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu

Figure 1 for Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese
Figure 2 for Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese
Figure 3 for Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese
Figure 4 for Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese
Viaarxiv icon

Are E2E ASR models ready for an industrial usage?

Dec 09, 2021
Valentin Vielzeuf, Grigory Antipov

Figure 1 for Are E2E ASR models ready for an industrial usage?
Figure 2 for Are E2E ASR models ready for an industrial usage?
Figure 3 for Are E2E ASR models ready for an industrial usage?
Viaarxiv icon