Alert button

"speech recognition": models, code, and papers
Alert button

End-to-End Neural Segmental Models for Speech Recognition

Aug 15, 2017
Hao Tang, Liang Lu, Lingpeng Kong, Kevin Gimpel, Karen Livescu, Chris Dyer, Noah A. Smith, Steve Renals

Figure 1 for End-to-End Neural Segmental Models for Speech Recognition
Figure 2 for End-to-End Neural Segmental Models for Speech Recognition
Figure 3 for End-to-End Neural Segmental Models for Speech Recognition
Figure 4 for End-to-End Neural Segmental Models for Speech Recognition
Viaarxiv icon

Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks

Add code
Bookmark button
Alert button
Jan 08, 2022
Shoukang Hu, Xurong Xie, Mingyu Cui, Jiajun Deng, Shansong Liu, Jianwei Yu, Mengzhe Geng, Xunying Liu, Helen Meng

Figure 1 for Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Figure 2 for Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Figure 3 for Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Figure 4 for Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks
Viaarxiv icon

Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech

May 09, 2019
Tobias Menne, Ilya Sklyar, Ralf Schlüter, Hermann Ney

Figure 1 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 2 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 3 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Figure 4 for Analysis of Deep Clustering as Preprocessing for Automatic Speech Recognition of Sparsely Overlapping Speech
Viaarxiv icon

Multi-style Training for South African Call Centre Audio

Add code
Bookmark button
Alert button
Feb 15, 2022
Walter Heymans, Marelie H. Davel, Charl van Heerden

Viaarxiv icon

Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement

Nov 12, 2022
Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk

Figure 1 for Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement
Viaarxiv icon

Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models

Jul 01, 2019
Ke Hu, Antoine Bruguier, Tara N. Sainath, Rohit Prabhavalkar, Golan Pundak

Figure 1 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 2 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 3 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Figure 4 for Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models
Viaarxiv icon

Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages

Add code
Bookmark button
Alert button
Oct 31, 2020
Trideba Padhi, Astik Biswas, Febe De Wet, Ewald van der Westhuizen, Thomas Niesler

Figure 1 for Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages
Figure 2 for Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages
Figure 3 for Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages
Figure 4 for Multilingual Bottleneck Features for Improving ASR Performance of Code-Switched Speech in Under-Resourced Languages
Viaarxiv icon

Error Correction in ASR using Sequence-to-Sequence Models

Feb 02, 2022
Samrat Dutta, Shreyansh Jain, Ayush Maheshwari, Ganesh Ramakrishnan, Preethi Jyothi

Figure 1 for Error Correction in ASR using Sequence-to-Sequence Models
Figure 2 for Error Correction in ASR using Sequence-to-Sequence Models
Figure 3 for Error Correction in ASR using Sequence-to-Sequence Models
Figure 4 for Error Correction in ASR using Sequence-to-Sequence Models
Viaarxiv icon

Understanding Audio Features via Trainable Basis Functions

Add code
Bookmark button
Alert button
Apr 25, 2022
Kwan Yee Heung, Kin Wai Cheuk, Dorien Herremans

Figure 1 for Understanding Audio Features via Trainable Basis Functions
Figure 2 for Understanding Audio Features via Trainable Basis Functions
Figure 3 for Understanding Audio Features via Trainable Basis Functions
Figure 4 for Understanding Audio Features via Trainable Basis Functions
Viaarxiv icon

Automated speech tools for helping communities process restricted-access corpora for language revival efforts

Add code
Bookmark button
Alert button
Apr 24, 2022
Nay San, Martijn Bartelds, Tolúlopé Ògúnrèmí, Alison Mount, Ruben Thompson, Michael Higgins, Roy Barker, Jane Simpson, Dan Jurafsky

Figure 1 for Automated speech tools for helping communities process restricted-access corpora for language revival efforts
Figure 2 for Automated speech tools for helping communities process restricted-access corpora for language revival efforts
Figure 3 for Automated speech tools for helping communities process restricted-access corpora for language revival efforts
Figure 4 for Automated speech tools for helping communities process restricted-access corpora for language revival efforts
Viaarxiv icon