Alert button

"speech recognition": models, code, and papers
Alert button

Toward Cross-Domain Speech Recognition with End-to-End Models

Mar 09, 2020
Thai-Son Nguyen, Sebastian Stüker, Alex Waibel

Figure 1 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 2 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 3 for Toward Cross-Domain Speech Recognition with End-to-End Models
Figure 4 for Toward Cross-Domain Speech Recognition with End-to-End Models
Viaarxiv icon

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

Jun 16, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Figure 1 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Figure 2 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Viaarxiv icon

Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers

Dec 07, 2022
Zijian Yang, Wei Zhou, Ralf Schlüter, Hermann Ney

Figure 1 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Figure 2 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Figure 3 for Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers
Viaarxiv icon

Evaluating context-invariance in unsupervised speech representations

Oct 27, 2022
Mark Hallap, Emmanuel Dupoux, Ewan Dunbar

Figure 1 for Evaluating context-invariance in unsupervised speech representations
Figure 2 for Evaluating context-invariance in unsupervised speech representations
Figure 3 for Evaluating context-invariance in unsupervised speech representations
Figure 4 for Evaluating context-invariance in unsupervised speech representations
Viaarxiv icon

Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition

Jan 14, 2022
Mengzhe Geng, Shansong Liu, Jianwei Yu, Xurong Xie, Shoukang Hu, Zi Ye, Zengrui Jin, Xunying Liu, Helen Meng

Figure 1 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 2 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 3 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Figure 4 for Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
Viaarxiv icon

Training Neural Speech Recognition Systems with Synthetic Speech Augmentation

Add code
Bookmark button
Alert button
Nov 02, 2018
Jason Li, Ravi Gadde, Boris Ginsburg, Vitaly Lavrukhin

Figure 1 for Training Neural Speech Recognition Systems with Synthetic Speech Augmentation
Figure 2 for Training Neural Speech Recognition Systems with Synthetic Speech Augmentation
Figure 3 for Training Neural Speech Recognition Systems with Synthetic Speech Augmentation
Figure 4 for Training Neural Speech Recognition Systems with Synthetic Speech Augmentation
Viaarxiv icon

Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge

Add code
Bookmark button
Alert button
Jul 29, 2022
Alef Iury Siqueira Ferreira, Gustavo dos Reis Oliveira

Figure 1 for Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Figure 2 for Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Figure 3 for Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Figure 4 for Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge
Viaarxiv icon

BUT Opensat 2019 Speech Recognition System

Jan 30, 2020
Martin Karafiát, Murali Karthick Baskar, Igor Szöke, Hari Krishna Vydana, Karel Veselý, Jan "Honza'' Černocký

Figure 1 for BUT Opensat 2019 Speech Recognition System
Figure 2 for BUT Opensat 2019 Speech Recognition System
Figure 3 for BUT Opensat 2019 Speech Recognition System
Figure 4 for BUT Opensat 2019 Speech Recognition System
Viaarxiv icon

Learning to Rank Microphones for Distant Speech Recognition

Add code
Bookmark button
Alert button
Apr 13, 2021
Samuele Cornell, Alessio Brutti, Marco Matassoni, Stefano Squartini

Figure 1 for Learning to Rank Microphones for Distant Speech Recognition
Figure 2 for Learning to Rank Microphones for Distant Speech Recognition
Figure 3 for Learning to Rank Microphones for Distant Speech Recognition
Figure 4 for Learning to Rank Microphones for Distant Speech Recognition
Viaarxiv icon

Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT

Add code
Bookmark button
Alert button
Feb 15, 2021
Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang

Figure 1 for Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Figure 2 for Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Figure 3 for Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Figure 4 for Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Viaarxiv icon