Alert button

"speech recognition": models, code, and papers
Alert button

Training for Speech Recognition on Coprocessors

Add code
Bookmark button
Alert button
Mar 22, 2020
Sebastian Baunsgaard, Sebastian B. Wrede, Pınar Tozun

Figure 1 for Training for Speech Recognition on Coprocessors
Figure 2 for Training for Speech Recognition on Coprocessors
Figure 3 for Training for Speech Recognition on Coprocessors
Figure 4 for Training for Speech Recognition on Coprocessors
Viaarxiv icon

Phoneme-based speech recognition for commanding the robotic Arm

Jan 25, 2020
Adwait P Naik

Figure 1 for Phoneme-based speech recognition for commanding the robotic Arm
Figure 2 for Phoneme-based speech recognition for commanding the robotic Arm
Figure 3 for Phoneme-based speech recognition for commanding the robotic Arm
Viaarxiv icon

Improving Data Driven Inverse Text Normalization using Data Augmentation

Jul 20, 2022
Laxmi Pandey, Debjyoti Paul, Pooja Chitkara, Yutong Pang, Xuedong Zhang, Kjell Schubert, Mark Chou, Shu Liu, Yatharth Saraf

Figure 1 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 2 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 3 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Figure 4 for Improving Data Driven Inverse Text Normalization using Data Augmentation
Viaarxiv icon

Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning

Jan 11, 2019
Ladislav Mošner, Minhua Wu, Anirudh Raju, Sree Hari Krishnan Parthasarathi, Kenichi Kumatani, Shiva Sundaram, Roland Maas, Björn Hoffmeister

Figure 1 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 2 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 3 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Figure 4 for Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning
Viaarxiv icon

Improving Readability for Automatic Speech Recognition Transcription

Add code
Bookmark button
Alert button
Apr 09, 2020
Junwei Liao, Sefik Emre Eskimez, Liyang Lu, Yu Shi, Ming Gong, Linjun Shou, Hong Qu, Michael Zeng

Figure 1 for Improving Readability for Automatic Speech Recognition Transcription
Figure 2 for Improving Readability for Automatic Speech Recognition Transcription
Figure 3 for Improving Readability for Automatic Speech Recognition Transcription
Figure 4 for Improving Readability for Automatic Speech Recognition Transcription
Viaarxiv icon

Distribution Aware Metrics for Conditional Natural Language Generation

Sep 15, 2022
David M Chan, Yiming Ni, Austin Myers, Sudheendra Vijayanarasimhan, David A Ross, John Canny

Figure 1 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 2 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 3 for Distribution Aware Metrics for Conditional Natural Language Generation
Figure 4 for Distribution Aware Metrics for Conditional Natural Language Generation
Viaarxiv icon

Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models

Add code
Bookmark button
Alert button
Feb 15, 2023
Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, Luis Chiruzzo, John E. Ortega, Gustavo A. Giménez-Lugo, Rolando Coto-Solano, Katharina Kann

Figure 1 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 2 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 3 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 4 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Viaarxiv icon

Recurrent Neural Network Transducer for Audio-Visual Speech Recognition

Nov 08, 2019
Takaki Makino, Hank Liao, Yannis Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan

Figure 1 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 2 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 3 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Figure 4 for Recurrent Neural Network Transducer for Audio-Visual Speech Recognition
Viaarxiv icon

End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Oct 07, 2021
Navin Raj Prabhu, Guillaume Carbajal, Nale Lehmann-Willenbrock, Timo Gerkmann

Figure 1 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks
Figure 2 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks
Figure 3 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks
Figure 4 for End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks
Viaarxiv icon

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

Add code
Bookmark button
Alert button
Mar 06, 2023
Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman

Figure 1 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 2 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 3 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 4 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Viaarxiv icon