Alert button

"speech recognition": models, code, and papers
Alert button

Accented Speech Recognition Inspired by Human Perception

Apr 09, 2021
Xiangyun Chu, Elizabeth Combs, Amber Wang, Michael Picheny

Figure 1 for Accented Speech Recognition Inspired by Human Perception
Figure 2 for Accented Speech Recognition Inspired by Human Perception
Figure 3 for Accented Speech Recognition Inspired by Human Perception
Figure 4 for Accented Speech Recognition Inspired by Human Perception
Viaarxiv icon

Macro-block dropout for improved regularization in training end-to-end speech recognition models

Dec 29, 2022
Chanwoo Kim, Sathish Indurti, Jinhwan Park, Wonyong Sung

Figure 1 for Macro-block dropout for improved regularization in training end-to-end speech recognition models
Figure 2 for Macro-block dropout for improved regularization in training end-to-end speech recognition models
Figure 3 for Macro-block dropout for improved regularization in training end-to-end speech recognition models
Figure 4 for Macro-block dropout for improved regularization in training end-to-end speech recognition models
Viaarxiv icon

Privacy attacks for automatic speech recognition acoustic models in a federated learning framework

Add code
Bookmark button
Alert button
Nov 06, 2021
Natalia Tomashenko, Salima Mdhaffar, Marc Tommasi, Yannick Estève, Jean-François Bonastre

Figure 1 for Privacy attacks for automatic speech recognition acoustic models in a federated learning framework
Figure 2 for Privacy attacks for automatic speech recognition acoustic models in a federated learning framework
Figure 3 for Privacy attacks for automatic speech recognition acoustic models in a federated learning framework
Figure 4 for Privacy attacks for automatic speech recognition acoustic models in a federated learning framework
Viaarxiv icon

Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition

Nov 07, 2021
Salima Mdhaffar, Jean-François Bonastre, Marc Tommasi, Natalia Tomashenko, Yannick Estève

Figure 1 for Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition
Figure 2 for Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition
Figure 3 for Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition
Figure 4 for Retrieving Speaker Information from Personalized Acoustic Models for Speech Recognition
Viaarxiv icon

Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder

Add code
Bookmark button
Alert button
Nov 15, 2022
Yuying Xie, Thomas Arildsen, Zheng-Hua Tan

Figure 1 for Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder
Figure 2 for Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder
Figure 3 for Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder
Figure 4 for Improved disentangled speech representations using contrastive learning in factorized hierarchical variational autoencoder
Viaarxiv icon

AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation

Add code
Bookmark button
Alert button
Dec 17, 2022
Xingshan Zeng, Liangyou Li, Qun Liu

Figure 1 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 2 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 3 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Figure 4 for AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation
Viaarxiv icon

Enabling On-Device Training of Speech Recognition Models with Federated Dropout

Oct 07, 2021
Dhruv Guliani, Lillian Zhou, Changwan Ryu, Tien-Ju Yang, Harry Zhang, Yonghui Xiao, Francoise Beaufays, Giovanni Motta

Figure 1 for Enabling On-Device Training of Speech Recognition Models with Federated Dropout
Figure 2 for Enabling On-Device Training of Speech Recognition Models with Federated Dropout
Figure 3 for Enabling On-Device Training of Speech Recognition Models with Federated Dropout
Figure 4 for Enabling On-Device Training of Speech Recognition Models with Federated Dropout
Viaarxiv icon

Neural Architecture Search for Speech Recognition

Jul 27, 2020
Shoukang Hu, Xurong Xie, Shansong Liu, Mengzhe Geng, Xunying Liu, Helen Meng

Figure 1 for Neural Architecture Search for Speech Recognition
Figure 2 for Neural Architecture Search for Speech Recognition
Figure 3 for Neural Architecture Search for Speech Recognition
Figure 4 for Neural Architecture Search for Speech Recognition
Viaarxiv icon

Better Transcription of UK Supreme Court Hearings

Dec 22, 2022
Hadeel Saadany, Catherine Breslin, Constantin Orăsan, Sophie Walker

Figure 1 for Better Transcription of UK Supreme Court Hearings
Figure 2 for Better Transcription of UK Supreme Court Hearings
Figure 3 for Better Transcription of UK Supreme Court Hearings
Figure 4 for Better Transcription of UK Supreme Court Hearings
Viaarxiv icon

Audio-visual multi-channel speech separation, dereverberation and recognition

Apr 08, 2022
Guinan Li, Jianwei Yu, Jiajun Deng, Xunying Liu, Helen Meng

Figure 1 for Audio-visual multi-channel speech separation, dereverberation and recognition
Figure 2 for Audio-visual multi-channel speech separation, dereverberation and recognition
Figure 3 for Audio-visual multi-channel speech separation, dereverberation and recognition
Figure 4 for Audio-visual multi-channel speech separation, dereverberation and recognition
Viaarxiv icon