Alert button

"speech recognition": models, code, and papers
Alert button

Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think

Add code
Bookmark button
Alert button
Jun 15, 2023
Tina Raissi, Christoph Lüscher, Moritz Gunz, Ralf Schlüter, Hermann Ney

Figure 1 for Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Figure 2 for Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Figure 3 for Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Figure 4 for Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Viaarxiv icon

Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

Nov 02, 2022
Lester Phillip Violeta, Ding Ma, Wen-Chin Huang, Tomoki Toda

Figure 1 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 2 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 3 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 4 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Viaarxiv icon

Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics

Add code
Bookmark button
Alert button
Jun 06, 2023
Bo Molenaar, Cristian Tejedor-Garcia, Helmer Strik, Catia Cucchiarini

Figure 1 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 2 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 3 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 4 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Viaarxiv icon

Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-To-End Automatic Speech Recognition

Add code
Bookmark button
Alert button
Dec 16, 2022
Aleksandr Laptev, Boris Ginsburg

Figure 1 for Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-To-End Automatic Speech Recognition
Figure 2 for Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-To-End Automatic Speech Recognition
Figure 3 for Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-To-End Automatic Speech Recognition
Figure 4 for Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-To-End Automatic Speech Recognition
Viaarxiv icon

Insights on Neural Representations for End-to-End Speech Recognition

May 19, 2022
Anna Ollerenshaw, Md Asif Jalal, Thomas Hain

Figure 1 for Insights on Neural Representations for End-to-End Speech Recognition
Figure 2 for Insights on Neural Representations for End-to-End Speech Recognition
Figure 3 for Insights on Neural Representations for End-to-End Speech Recognition
Figure 4 for Insights on Neural Representations for End-to-End Speech Recognition
Viaarxiv icon

Avoid Overthinking in Self-Supervised Models for Speech Recognition

Add code
Bookmark button
Alert button
Nov 01, 2022
Dan Berrebbi, Brian Yan, Shinji Watanabe

Figure 1 for Avoid Overthinking in Self-Supervised Models for Speech Recognition
Figure 2 for Avoid Overthinking in Self-Supervised Models for Speech Recognition
Figure 3 for Avoid Overthinking in Self-Supervised Models for Speech Recognition
Figure 4 for Avoid Overthinking in Self-Supervised Models for Speech Recognition
Viaarxiv icon

Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers

May 09, 2023
Grant P. Strimel, Yi Xie, Brian King, Martin Radfar, Ariya Rastrow, Athanasios Mouchtaris

Figure 1 for Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers
Figure 2 for Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers
Figure 3 for Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers
Figure 4 for Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers
Viaarxiv icon

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

Add code
Bookmark button
Alert button
May 25, 2023
Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei

Figure 1 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 2 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 3 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Figure 4 for VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation
Viaarxiv icon

Audio-Visual Speech Enhancement with Score-Based Generative Models

Jun 02, 2023
Julius Richter, Simone Frintrop, Timo Gerkmann

Figure 1 for Audio-Visual Speech Enhancement with Score-Based Generative Models
Figure 2 for Audio-Visual Speech Enhancement with Score-Based Generative Models
Figure 3 for Audio-Visual Speech Enhancement with Score-Based Generative Models
Figure 4 for Audio-Visual Speech Enhancement with Score-Based Generative Models
Viaarxiv icon

Writer adaptation for offline text recognition: An exploration of neural network-based methods

Add code
Bookmark button
Alert button
Jul 11, 2023
Tobias van der Werff, Maruf A. Dhali, Lambert Schomaker

Figure 1 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 2 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 3 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 4 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Viaarxiv icon