Alert button

"speech recognition": models, code, and papers
Alert button

Avoid Overthinking in Self-Supervised Models for Speech Recognition

Add code
Bookmark button
Alert button
Nov 01, 2022
Dan Berrebbi, Brian Yan, Shinji Watanabe

Figure 1 for Avoid Overthinking in Self-Supervised Models for Speech Recognition
Figure 2 for Avoid Overthinking in Self-Supervised Models for Speech Recognition
Figure 3 for Avoid Overthinking in Self-Supervised Models for Speech Recognition
Figure 4 for Avoid Overthinking in Self-Supervised Models for Speech Recognition
Viaarxiv icon

Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics

Add code
Bookmark button
Alert button
Jun 06, 2023
Bo Molenaar, Cristian Tejedor-Garcia, Helmer Strik, Catia Cucchiarini

Figure 1 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 2 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 3 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 4 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Viaarxiv icon

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Add code
Bookmark button
Alert button
Jul 05, 2022
Tongtong Song, Qiang Xu, Meng Ge, Longbiao Wang, Hao Shi, Yongjie Lv, Yuqin Lin, Jianwu Dang

Figure 1 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 2 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 3 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 4 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Viaarxiv icon

Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers

May 09, 2023
Grant P. Strimel, Yi Xie, Brian King, Martin Radfar, Ariya Rastrow, Athanasios Mouchtaris

Figure 1 for Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers
Figure 2 for Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers
Figure 3 for Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers
Figure 4 for Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers
Viaarxiv icon

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

Add code
Bookmark button
Alert button
May 25, 2023
Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei

Figure 1 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 2 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 3 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 4 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Viaarxiv icon

Audio-Visual Speech Enhancement with Score-Based Generative Models

Jun 02, 2023
Julius Richter, Simone Frintrop, Timo Gerkmann

Figure 1 for Audio-Visual Speech Enhancement with Score-Based Generative Models
Figure 2 for Audio-Visual Speech Enhancement with Score-Based Generative Models
Figure 3 for Audio-Visual Speech Enhancement with Score-Based Generative Models
Figure 4 for Audio-Visual Speech Enhancement with Score-Based Generative Models
Viaarxiv icon

Writer adaptation for offline text recognition: An exploration of neural network-based methods

Add code
Bookmark button
Alert button
Jul 11, 2023
Tobias van der Werff, Maruf A. Dhali, Lambert Schomaker

Figure 1 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 2 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 3 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 4 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Viaarxiv icon

MobileASR: A resource-aware on-device personalisation framework for automatic speech recognition in mobile phones

Jun 15, 2023
Zitha Sasindran, Harsha Yelchuri, Pooja Rao, T. V. Prabhakar

Figure 1 for MobileASR: A resource-aware on-device personalisation framework for automatic speech recognition in mobile phones
Figure 2 for MobileASR: A resource-aware on-device personalisation framework for automatic speech recognition in mobile phones
Figure 3 for MobileASR: A resource-aware on-device personalisation framework for automatic speech recognition in mobile phones
Figure 4 for MobileASR: A resource-aware on-device personalisation framework for automatic speech recognition in mobile phones
Viaarxiv icon

Speech inpainting: Context-based speech synthesis guided by video

Add code
Bookmark button
Alert button
Jun 01, 2023
Juan F. Montesinos, Daniel Michelsanti, Gloria Haro, Zheng-Hua Tan, Jesper Jensen

Figure 1 for Speech inpainting: Context-based speech synthesis guided by video
Figure 2 for Speech inpainting: Context-based speech synthesis guided by video
Figure 3 for Speech inpainting: Context-based speech synthesis guided by video
Figure 4 for Speech inpainting: Context-based speech synthesis guided by video
Viaarxiv icon

Label Aware Speech Representation Learning For Language Identification

Jun 07, 2023
Shikhar Vashishth, Shikhar Bharadwaj, Sriram Ganapathy, Ankur Bapna, Min Ma, Wei Han, Vera Axelrod, Partha Talukdar

Figure 1 for Label Aware Speech Representation Learning For Language Identification
Figure 2 for Label Aware Speech Representation Learning For Language Identification
Figure 3 for Label Aware Speech Representation Learning For Language Identification
Figure 4 for Label Aware Speech Representation Learning For Language Identification
Viaarxiv icon