Alert button

"speech recognition": models, code, and papers
Alert button

Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition

Oct 27, 2022
Steven Vander Eeckt, Hugo Van hamme

Figure 1 for Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition
Figure 2 for Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition
Viaarxiv icon

HuBERT-TR: Reviving Turkish Automatic Speech Recognition with Self-supervised Speech Representation Learning

Oct 13, 2022
Ali Safaya, Engin Erzin

Figure 1 for HuBERT-TR: Reviving Turkish Automatic Speech Recognition with Self-supervised Speech Representation Learning
Figure 2 for HuBERT-TR: Reviving Turkish Automatic Speech Recognition with Self-supervised Speech Representation Learning
Figure 3 for HuBERT-TR: Reviving Turkish Automatic Speech Recognition with Self-supervised Speech Representation Learning
Figure 4 for HuBERT-TR: Reviving Turkish Automatic Speech Recognition with Self-supervised Speech Representation Learning
Viaarxiv icon

Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages

Sep 08, 2022
Li Miao, Jian Wu, Piyush Behre, Shuangyu Chang, Sarangarajan Parthasarathy

Figure 1 for Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages
Figure 2 for Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages
Figure 3 for Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages
Figure 4 for Multilingual Transformer Language Model for Speech Recognition in Low-resource Languages
Viaarxiv icon

The History of Speech Recognition to the Year 2030

Add code
Bookmark button
Alert button
Jul 30, 2021
Awni Hannun

Figure 1 for The History of Speech Recognition to the Year 2030
Figure 2 for The History of Speech Recognition to the Year 2030
Figure 3 for The History of Speech Recognition to the Year 2030
Figure 4 for The History of Speech Recognition to the Year 2030
Viaarxiv icon

Unit-based Speech-to-Speech Translation Without Parallel Data

Add code
Bookmark button
Alert button
May 24, 2023
Anuj Diwan, Anirudh Srinivasan, David Harwath, Eunsol Choi

Figure 1 for Unit-based Speech-to-Speech Translation Without Parallel Data
Figure 2 for Unit-based Speech-to-Speech Translation Without Parallel Data
Figure 3 for Unit-based Speech-to-Speech Translation Without Parallel Data
Figure 4 for Unit-based Speech-to-Speech Translation Without Parallel Data
Viaarxiv icon

Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation

Add code
Bookmark button
Alert button
May 24, 2023
Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling

Figure 1 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation
Figure 2 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation
Figure 3 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation
Figure 4 for Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation
Viaarxiv icon

The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jun 20, 2022
Jonathan Mukiibi, Andrew Katumba, Joyce Nakatumba-Nabende, Ali Hussein, Josh Meyer

Figure 1 for The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
Figure 2 for The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
Figure 3 for The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
Figure 4 for The Makerere Radio Speech Corpus: A Luganda Radio Corpus for Automatic Speech Recognition
Viaarxiv icon

Exploring the Role of Audio in Video Captioning

Jun 21, 2023
Yuhan Shen, Linjie Yang, Longyin Wen, Haichao Yu, Ehsan Elhamifar, Heng Wang

Figure 1 for Exploring the Role of Audio in Video Captioning
Figure 2 for Exploring the Role of Audio in Video Captioning
Figure 3 for Exploring the Role of Audio in Video Captioning
Figure 4 for Exploring the Role of Audio in Video Captioning
Viaarxiv icon

Computing Optimal Location of Microphone for Improved Speech Recognition

Mar 24, 2022
Karan Nathwani, Bhavya Dixit, Sunil Kumar Kopparapu

Figure 1 for Computing Optimal Location of Microphone for Improved Speech Recognition
Figure 2 for Computing Optimal Location of Microphone for Improved Speech Recognition
Figure 3 for Computing Optimal Location of Microphone for Improved Speech Recognition
Figure 4 for Computing Optimal Location of Microphone for Improved Speech Recognition
Viaarxiv icon

Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition

Oct 27, 2022
Yujin Wang, Changli Tang, Ziyang Ma, Zhisheng Zheng, Xie Chen, Wei-Qiang Zhang

Figure 1 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 2 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 3 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Figure 4 for Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition
Viaarxiv icon