Alert button

"speech": models, code, and papers
Alert button

Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training

Jun 16, 2022
Bowen Zhang, Songjun Cao, Xiaoming Zhang, Yike Zhang, Long Ma, Takahiro Shinozaki

Figure 1 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 2 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 3 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Figure 4 for Censer: Curriculum Semi-supervised Learning for Speech Recognition Based on Self-supervised Pre-training
Viaarxiv icon

Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition

Add code
Bookmark button
Alert button
Mar 31, 2022
Anirudh Gupta, Rishabh Gaur, Ankur Dhuriya, Harveen Singh Chadha, Neeraj Chhimwal, Priyanshi Shah, Vivek Raghavan

Figure 1 for Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition
Figure 2 for Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition
Figure 3 for Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition
Figure 4 for Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition
Viaarxiv icon

Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation

Add code
Bookmark button
Alert button
Apr 19, 2022
Keqi Deng, Shinji Watanabe, Jiatong Shi, Siddhant Arora

Figure 1 for Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation
Figure 2 for Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation
Figure 3 for Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation
Figure 4 for Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation
Viaarxiv icon

data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup

Add code
Bookmark button
Alert button
Nov 02, 2022
Vasista Sai Lodagala, Sreyan Ghosh, S. Umesh

Figure 1 for data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup
Figure 2 for data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup
Figure 3 for data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup
Viaarxiv icon

SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training

Oct 20, 2021
Ankur Bapna, Yu-an Chung, Nan Wu, Anmol Gulati, Ye Jia, Jonathan H. Clark, Melvin Johnson, Jason Riesa, Alexis Conneau, Yu Zhang

Figure 1 for SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Figure 2 for SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Figure 3 for SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Figure 4 for SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training
Viaarxiv icon

Self-critical Sequence Training for Automatic Speech Recognition

Apr 13, 2022
Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng

Figure 1 for Self-critical Sequence Training for Automatic Speech Recognition
Figure 2 for Self-critical Sequence Training for Automatic Speech Recognition
Figure 3 for Self-critical Sequence Training for Automatic Speech Recognition
Figure 4 for Self-critical Sequence Training for Automatic Speech Recognition
Viaarxiv icon

Pushing the performances of ASR models on English and Spanish accents

Dec 22, 2022
Pooja Chitkara, Morgane Riviere, Jade Copet, Frank Zhang, Yatharth Saraf

Figure 1 for Pushing the performances of ASR models on English and Spanish accents
Figure 2 for Pushing the performances of ASR models on English and Spanish accents
Figure 3 for Pushing the performances of ASR models on English and Spanish accents
Figure 4 for Pushing the performances of ASR models on English and Spanish accents
Viaarxiv icon

Everything is Connected: Graph Neural Networks

Jan 19, 2023
Petar Veličković

Viaarxiv icon

Equivariant and Steerable Neural Networks: A review with special emphasis on the symmetric group

Jan 08, 2023
Patrick Krüger, Hanno Gottschalk

Figure 1 for Equivariant and Steerable Neural Networks: A review with special emphasis on the symmetric group
Figure 2 for Equivariant and Steerable Neural Networks: A review with special emphasis on the symmetric group
Figure 3 for Equivariant and Steerable Neural Networks: A review with special emphasis on the symmetric group
Figure 4 for Equivariant and Steerable Neural Networks: A review with special emphasis on the symmetric group
Viaarxiv icon

MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient

Add code
Bookmark button
Alert button
Mar 16, 2022
Andong Li, Chengshi Zheng, Ziyang Zhang, Xiaodong Li

Figure 1 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 2 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 3 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 4 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Viaarxiv icon