Alert button

"speech recognition": models, code, and papers
Alert button

Finnish Parliament ASR corpus - Analysis, benchmarks and statistics

Mar 28, 2022
Anja Virkkunen, Aku Rouhe, Nhan Phan, Mikko Kurimo

Figure 1 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 2 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 3 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Figure 4 for Finnish Parliament ASR corpus - Analysis, benchmarks and statistics
Viaarxiv icon

Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR

Jan 26, 2022
Yufei Liu, Rao Ma, Haihua Xu, Yi He, Zejun Ma, Weibin Zhang

Figure 1 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 2 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 3 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 4 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Viaarxiv icon

Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0

Sep 27, 2022
Bagus Tris Atmaja, Akira Sasou

Figure 1 for Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0
Figure 2 for Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0
Figure 3 for Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0
Figure 4 for Predicting Affective Vocal Bursts with Finetuned wav2vec 2.0
Viaarxiv icon

NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy

Feb 11, 2022
Yash Mehta, Colin White, Arber Zela, Arjun Krishnakumar, Guri Zabergja, Shakiba Moradian, Mahmoud Safari, Kaicheng Yu, Frank Hutter

Figure 1 for NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Figure 2 for NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Figure 3 for NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Figure 4 for NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy
Viaarxiv icon

Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling

Mar 13, 2019
Peidong Wang, Ke Tan, DeLiang Wang

Figure 1 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 2 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 3 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Figure 4 for Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling
Viaarxiv icon

Improving RNN-T ASR Performance with Date-Time and Location Awareness

Jun 16, 2021
Swayambhu Nath Ray, Soumyajit Mitra, Raghavendra Bilgi, Sri Garimella

Figure 1 for Improving RNN-T ASR Performance with Date-Time and Location Awareness
Figure 2 for Improving RNN-T ASR Performance with Date-Time and Location Awareness
Figure 3 for Improving RNN-T ASR Performance with Date-Time and Location Awareness
Figure 4 for Improving RNN-T ASR Performance with Date-Time and Location Awareness
Viaarxiv icon

Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition

Sep 08, 2015
Mortaza Doulaty, Oscar Saz, Thomas Hain

Figure 1 for Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition
Figure 2 for Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition
Figure 3 for Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition
Figure 4 for Unsupervised Domain Discovery using Latent Dirichlet Allocation for Acoustic Modelling in Speech Recognition
Viaarxiv icon

Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation

Feb 23, 2021
Richeng Duan, Nancy F. Chen

Figure 1 for Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation
Figure 2 for Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation
Figure 3 for Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation
Figure 4 for Senone-aware Adversarial Multi-task Training for Unsupervised Child to Adult Speech Adaptation
Viaarxiv icon

Recognition of Isolated Words using Zernike and MFCC features for Audio Visual Speech Recognition

Jul 04, 2014
Prashant Bordea, Amarsinh Varpeb, Ramesh Manzac, Pravin Yannawara

Figure 1 for Recognition of Isolated Words using Zernike and MFCC features for Audio Visual Speech Recognition
Figure 2 for Recognition of Isolated Words using Zernike and MFCC features for Audio Visual Speech Recognition
Figure 3 for Recognition of Isolated Words using Zernike and MFCC features for Audio Visual Speech Recognition
Figure 4 for Recognition of Isolated Words using Zernike and MFCC features for Audio Visual Speech Recognition
Viaarxiv icon

Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions

Jan 03, 2022
Marwan Omar, Soohyeon Choi, DaeHun Nyang, David Mohaisen

Figure 1 for Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
Figure 2 for Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
Figure 3 for Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
Figure 4 for Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
Viaarxiv icon