Alert button

"speech recognition": models, code, and papers
Alert button

Improving Punctuation Restoration for Speech Transcripts via External Data

Oct 01, 2021
Xue-Yong Fu, Cheng Chen, Md Tahmid Rahman Laskar, Shashi Bhushan TN, Simon Corston-Oliver

Figure 1 for Improving Punctuation Restoration for Speech Transcripts via External Data
Figure 2 for Improving Punctuation Restoration for Speech Transcripts via External Data
Figure 3 for Improving Punctuation Restoration for Speech Transcripts via External Data
Figure 4 for Improving Punctuation Restoration for Speech Transcripts via External Data
Viaarxiv icon

Wav2Vec2.0 on the Edge: Performance Evaluation

Feb 12, 2022
Santosh Gondi

Figure 1 for Wav2Vec2.0 on the Edge: Performance Evaluation
Figure 2 for Wav2Vec2.0 on the Edge: Performance Evaluation
Figure 3 for Wav2Vec2.0 on the Edge: Performance Evaluation
Figure 4 for Wav2Vec2.0 on the Edge: Performance Evaluation
Viaarxiv icon

Towards Identity Preserving Normal to Dysarthric Voice Conversion

Add code
Bookmark button
Alert button
Oct 15, 2021
Wen-Chin Huang, Bence Mark Halpern, Lester Phillip Violeta, Odette Scharenborg, Tomoki Toda

Figure 1 for Towards Identity Preserving Normal to Dysarthric Voice Conversion
Figure 2 for Towards Identity Preserving Normal to Dysarthric Voice Conversion
Figure 3 for Towards Identity Preserving Normal to Dysarthric Voice Conversion
Figure 4 for Towards Identity Preserving Normal to Dysarthric Voice Conversion
Viaarxiv icon

End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results

Add code
Bookmark button
Alert button
Dec 04, 2014
Jan Chorowski, Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio

Figure 1 for End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results
Figure 2 for End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results
Figure 3 for End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results
Figure 4 for End-to-end Continuous Speech Recognition using Attention-based Recurrent NN: First Results
Viaarxiv icon

Bayesian Transformer Language Models for Speech Recognition

Feb 09, 2021
Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng

Figure 1 for Bayesian Transformer Language Models for Speech Recognition
Figure 2 for Bayesian Transformer Language Models for Speech Recognition
Figure 3 for Bayesian Transformer Language Models for Speech Recognition
Figure 4 for Bayesian Transformer Language Models for Speech Recognition
Viaarxiv icon

Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech

Sep 14, 2021
Katrin Tomanek, Vicky Zayats, Dirk Padfield, Kara Vaillancourt, Fadi Biadsy

Figure 1 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 2 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 3 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Figure 4 for Residual Adapters for Parameter-Efficient ASR Adaptation to Atypical and Accented Speech
Viaarxiv icon

Continuous Speech Separation with Recurrent Selective Attention Network

Oct 28, 2021
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li

Figure 1 for Continuous Speech Separation with Recurrent Selective Attention Network
Figure 2 for Continuous Speech Separation with Recurrent Selective Attention Network
Figure 3 for Continuous Speech Separation with Recurrent Selective Attention Network
Figure 4 for Continuous Speech Separation with Recurrent Selective Attention Network
Viaarxiv icon

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning

Oct 18, 2022
Zuheng Kang, Jianzong Wang, Junqing Peng, Jing Xiao

Figure 1 for SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Figure 2 for SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Figure 3 for SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Figure 4 for SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning
Viaarxiv icon

LightSeq2: Accelerated Training for Transformer-based Models on GPUs

Add code
Bookmark button
Alert button
Oct 27, 2021
Xiaohui Wang, Ying Xiong, Xian Qian, Yang Wei, Lei Li, Mingxuan Wang

Figure 1 for LightSeq2: Accelerated Training for Transformer-based Models on GPUs
Figure 2 for LightSeq2: Accelerated Training for Transformer-based Models on GPUs
Figure 3 for LightSeq2: Accelerated Training for Transformer-based Models on GPUs
Figure 4 for LightSeq2: Accelerated Training for Transformer-based Models on GPUs
Viaarxiv icon

SpliceOut: A Simple and Efficient Audio Augmentation Method

Add code
Bookmark button
Alert button
Oct 13, 2021
Arjit Jain, Pranay Reddy Samala, Deepak Mittal, Preethi Jyoti, Maneesh Singh

Figure 1 for SpliceOut: A Simple and Efficient Audio Augmentation Method
Figure 2 for SpliceOut: A Simple and Efficient Audio Augmentation Method
Figure 3 for SpliceOut: A Simple and Efficient Audio Augmentation Method
Figure 4 for SpliceOut: A Simple and Efficient Audio Augmentation Method
Viaarxiv icon