Alert button

"speech recognition": models, code, and papers
Alert button

SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition

Oct 08, 2021
Li Fu, Xiaoxiao Li, Runyu Wang, Zhengchen Zhang, Youzheng Wu, Xiaodong He, Bowen Zhou

Figure 1 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 2 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 3 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 4 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Viaarxiv icon

Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention

Add code
Bookmark button
Alert button
Oct 23, 2020
Menglong Xu, Shengqiang Li, Xiao-Lei Zhang

Figure 1 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 2 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 3 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 4 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Viaarxiv icon

On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech

Jun 18, 2021
Katrin Tomanek, Françoise Beaufays, Julie Cattiau, Angad Chandorkar, Khe Chai Sim

Figure 1 for On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Figure 2 for On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Figure 3 for On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Figure 4 for On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Viaarxiv icon

The Multilingual TEDx Corpus for Speech Recognition and Translation

Add code
Bookmark button
Alert button
Feb 02, 2021
Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post

Figure 1 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 2 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 3 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 4 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Viaarxiv icon

AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation

Add code
Bookmark button
Alert button
Dec 17, 2022
Xingshan Zeng, Liangyou Li, Qun Liu

Figure 1 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 2 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 3 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 4 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Viaarxiv icon

Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition

Dec 15, 2021
Kenichi Kumatani, Robert Gmyr, Felipe Cruz Salinas, Linquan Liu, Wei Zuo, Devang Patel, Eric Sun, Yu Shi

Figure 1 for Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Figure 2 for Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Figure 3 for Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Figure 4 for Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition
Viaarxiv icon

Better Transcription of UK Supreme Court Hearings

Dec 22, 2022
Hadeel Saadany, Catherine Breslin, Constantin Orăsan, Sophie Walker

Figure 1 for Better Transcription of UK Supreme Court Hearings
Figure 2 for Better Transcription of UK Supreme Court Hearings
Figure 3 for Better Transcription of UK Supreme Court Hearings
Figure 4 for Better Transcription of UK Supreme Court Hearings
Viaarxiv icon

Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers

Jun 19, 2020
Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka

Figure 1 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 2 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 3 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Figure 4 for Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of Any Number of Speakers
Viaarxiv icon

Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI

Add code
Bookmark button
Alert button
Dec 05, 2021
Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou

Figure 1 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 2 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 3 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 4 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Viaarxiv icon

Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching

Dec 19, 2021
Chia-Yu Li, Ngoc Thang Vu

Figure 1 for Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching
Figure 2 for Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching
Figure 3 for Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching
Figure 4 for Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching
Viaarxiv icon