Alert button

"speech recognition": models, code, and papers
Alert button

Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models

Feb 15, 2023
Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, Luis Chiruzzo, John E. Ortega, Gustavo A. Giménez-Lugo, Rolando Coto-Solano, Katharina Kann

Figure 1 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 2 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 3 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 4 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Viaarxiv icon

A Language Agnostic Multilingual Streaming On-Device ASR System

Aug 29, 2022
Bo Li, Tara N. Sainath, Ruoming Pang, Shuo-yiin Chang, Qiumin Xu, Trevor Strohman, Vince Chen, Qiao Liang, Heguang Liu, Yanzhang He, Parisa Haghani, Sameer Bidichandani

Figure 1 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 2 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 3 for A Language Agnostic Multilingual Streaming On-Device ASR System
Figure 4 for A Language Agnostic Multilingual Streaming On-Device ASR System
Viaarxiv icon

Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition

Jun 09, 2020
Gurunath Reddy Madhumani, Sanket Shah, Basil Abraham, Vikas Joshi, Sunayana Sitaram

Figure 1 for Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Figure 2 for Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Figure 3 for Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Figure 4 for Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition
Viaarxiv icon

Batch-normalized joint training for DNN-based distant speech recognition

Mar 24, 2017
Mirco Ravanelli, Philemon Brakel, Maurizio Omologo, Yoshua Bengio

Figure 1 for Batch-normalized joint training for DNN-based distant speech recognition
Figure 2 for Batch-normalized joint training for DNN-based distant speech recognition
Figure 3 for Batch-normalized joint training for DNN-based distant speech recognition
Figure 4 for Batch-normalized joint training for DNN-based distant speech recognition
Viaarxiv icon

Vakyansh: ASR Toolkit for Low Resource Indic languages

Mar 30, 2022
Harveen Singh Chadha, Anirudh Gupta, Priyanshi Shah, Neeraj Chhimwal, Ankur Dhuriya, Rishabh Gaur, Vivek Raghavan

Figure 1 for Vakyansh: ASR Toolkit for Low Resource Indic languages
Figure 2 for Vakyansh: ASR Toolkit for Low Resource Indic languages
Figure 3 for Vakyansh: ASR Toolkit for Low Resource Indic languages
Figure 4 for Vakyansh: ASR Toolkit for Low Resource Indic languages
Viaarxiv icon

English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System

May 09, 2021
Guillermo Cámbara, Alex Peiró-Lilja, Mireia Farrús, Jordi Luque

Figure 1 for English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System
Figure 2 for English Accent Accuracy Analysis in a State-of-the-Art Automatic Speech Recognition System
Viaarxiv icon

Out-of-Distribution Representation Learning for Time Series Classification

Sep 26, 2022
Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xing Xie

Figure 1 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 2 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 3 for Out-of-Distribution Representation Learning for Time Series Classification
Figure 4 for Out-of-Distribution Representation Learning for Time Series Classification
Viaarxiv icon

Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition

Oct 07, 2021
Arya Aftab, Alireza Morsali, Shahrokh Ghaemmaghami, Benoit Champagne

Figure 1 for Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
Figure 2 for Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
Figure 3 for Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
Figure 4 for Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
Viaarxiv icon

cif-based collaborative decoding for end-to-end contextual speech recognition

Dec 17, 2020
Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu

Figure 1 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 2 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 3 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 4 for cif-based collaborative decoding for end-to-end contextual speech recognition
Viaarxiv icon

Distribution Aware Metrics for Conditional Natural Language Generation

Sep 29, 2022
David M Chan, Yiming Ni, David A Ross, Sudheendra Vijayanarasimhan, Austin Myers, John Canny

Figure 1 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 2 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 3 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 4 for Distribution Aware Metrics for Conditional Natural Language Generation
Viaarxiv icon