Alert button

"speech recognition": models, code, and papers
Alert button

Exploring Transformers for Large-Scale Speech Recognition

May 19, 2020
Liang Lu, Changliang Liu, Jinyu Li, Yifan Gong

Figure 1 for Exploring Transformers for Large-Scale Speech Recognition
Figure 2 for Exploring Transformers for Large-Scale Speech Recognition
Figure 3 for Exploring Transformers for Large-Scale Speech Recognition
Figure 4 for Exploring Transformers for Large-Scale Speech Recognition
Viaarxiv icon

Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric

Oct 11, 2021
Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer

Figure 1 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 2 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 3 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Figure 4 for Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Viaarxiv icon

Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder

Add code
Bookmark button
Alert button
Nov 15, 2022
Yuying Xie, Thomas Arildsen, Zheng-Hua Tan

Figure 1 for Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder
Figure 2 for Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder
Figure 3 for Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder
Figure 4 for Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder
Viaarxiv icon

On the limit of English conversational speech recognition

May 03, 2021
Zoltán Tüske, George Saon, Brian Kingsbury

Figure 1 for On the limit of English conversational speech recognition
Figure 2 for On the limit of English conversational speech recognition
Figure 3 for On the limit of English conversational speech recognition
Viaarxiv icon

SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition

Oct 08, 2021
Li Fu, Xiaoxiao Li, Runyu Wang, Zhengchen Zhang, Youzheng Wu, Xiaodong He, Bowen Zhou

Figure 1 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 2 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 3 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 4 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Viaarxiv icon

Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention

Add code
Bookmark button
Alert button
Oct 23, 2020
Menglong Xu, Shengqiang Li, Xiao-Lei Zhang

Figure 1 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 2 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 3 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Figure 4 for Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention
Viaarxiv icon

AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation

Add code
Bookmark button
Alert button
Dec 17, 2022
Xingshan Zeng, Liangyou Li, Qun Liu

Figure 1 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 2 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 3 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 4 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Viaarxiv icon

On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech

Jun 18, 2021
Katrin Tomanek, Françoise Beaufays, Julie Cattiau, Angad Chandorkar, Khe Chai Sim

Figure 1 for On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Figure 2 for On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Figure 3 for On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Figure 4 for On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech
Viaarxiv icon

Better Transcription of UK Supreme Court Hearings

Dec 22, 2022
Hadeel Saadany, Catherine Breslin, Constantin Orăsan, Sophie Walker

Figure 1 for Better Transcription of UK Supreme Court Hearings
Figure 2 for Better Transcription of UK Supreme Court Hearings
Figure 3 for Better Transcription of UK Supreme Court Hearings
Figure 4 for Better Transcription of UK Supreme Court Hearings
Viaarxiv icon

The Multilingual TEDx Corpus for Speech Recognition and Translation

Add code
Bookmark button
Alert button
Feb 02, 2021
Elizabeth Salesky, Matthew Wiesner, Jacob Bremerman, Roldano Cattoni, Matteo Negri, Marco Turchi, Douglas W. Oard, Matt Post

Figure 1 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 2 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 3 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Figure 4 for The Multilingual TEDx Corpus for Speech Recognition and Translation
Viaarxiv icon