Alert button

"speech recognition": models, code, and papers
Alert button

On the N-gram Approximation of Pre-trained Language Models

Jun 12, 2023
Aravind Krishnan, Jesujoba Alabi, Dietrich Klakow

Figure 1 for On the N-gram Approximation of Pre-trained Language Models
Figure 2 for On the N-gram Approximation of Pre-trained Language Models
Figure 3 for On the N-gram Approximation of Pre-trained Language Models
Figure 4 for On the N-gram Approximation of Pre-trained Language Models
Viaarxiv icon

Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data

Add code
Bookmark button
Alert button
Jul 04, 2023
Guangzhi Sun, Chao Zhang, Ivan Vulić, Paweł Budzianowski, Philip C. Woodland

Figure 1 for Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data
Figure 2 for Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data
Figure 3 for Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data
Figure 4 for Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data
Viaarxiv icon

End-to-End Joint Target and Non-Target Speakers ASR

Jun 04, 2023
Ryo Masumura, Naoki Makishima, Taiga Yamane, Yoshihiko Yamazaki, Saki Mizuno, Mana Ihori, Mihiro Uchida, Keita Suzuki, Hiroshi Sato, Tomohiro Tanaka, Akihiko Takashima, Satoshi Suzuki, Takafumi Moriya, Nobukatsu Hojo, Atsushi Ando

Figure 1 for End-to-End Joint Target and Non-Target Speakers ASR
Figure 2 for End-to-End Joint Target and Non-Target Speakers ASR
Figure 3 for End-to-End Joint Target and Non-Target Speakers ASR
Viaarxiv icon

Improving Noisy Student Training on Non-target Domain Data for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Nov 09, 2022
Yu Chen, Wen Ding, Junjie Lai

Figure 1 for Improving Noisy Student Training on Non-target Domain Data for Automatic Speech Recognition
Figure 2 for Improving Noisy Student Training on Non-target Domain Data for Automatic Speech Recognition
Figure 3 for Improving Noisy Student Training on Non-target Domain Data for Automatic Speech Recognition
Figure 4 for Improving Noisy Student Training on Non-target Domain Data for Automatic Speech Recognition
Viaarxiv icon

E-Branchformer: Branchformer with Enhanced merging for speech recognition

Add code
Bookmark button
Alert button
Sep 30, 2022
Kwangyoun Kim, Felix Wu, Yifan Peng, Jing Pan, Prashant Sridhar, Kyu J. Han, Shinji Watanabe

Figure 1 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 2 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 3 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 4 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Viaarxiv icon

The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results

Add code
Bookmark button
Alert button
Nov 03, 2022
Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu

Figure 1 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 2 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 3 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Figure 4 for The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results
Viaarxiv icon

Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think

Add code
Bookmark button
Alert button
Jun 15, 2023
Tina Raissi, Christoph Lüscher, Moritz Gunz, Ralf Schlüter, Hermann Ney

Figure 1 for Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Figure 2 for Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Figure 3 for Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Figure 4 for Competitive and Resource Efficient Factored Hybrid HMM Systems are Simpler Than You Think
Viaarxiv icon

Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering

Add code
Bookmark button
Alert button
May 18, 2023
Heng-Jui Chang, Alexander H. Liu, James Glass

Figure 1 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 2 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 3 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Figure 4 for Self-supervised Fine-tuning for Improved Content Representations by Speaker-invariant Clustering
Viaarxiv icon

Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics

Add code
Bookmark button
Alert button
Jun 06, 2023
Bo Molenaar, Cristian Tejedor-Garcia, Helmer Strik, Catia Cucchiarini

Figure 1 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 2 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 3 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Figure 4 for Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics
Viaarxiv icon

Writer adaptation for offline text recognition: An exploration of neural network-based methods

Add code
Bookmark button
Alert button
Jul 11, 2023
Tobias van der Werff, Maruf A. Dhali, Lambert Schomaker

Figure 1 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 2 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 3 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Figure 4 for Writer adaptation for offline text recognition: An exploration of neural network-based methods
Viaarxiv icon