Alert button

"speech recognition": models, code, and papers
Alert button

Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language

Add code
Bookmark button
Alert button
Dec 14, 2022
Alexei Baevski, Arun Babu, Wei-Ning Hsu, Michael Auli

Figure 1 for Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Figure 2 for Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Figure 3 for Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Figure 4 for Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language
Viaarxiv icon

Convolutional Speech Recognition with Pitch and Voice Quality Features

Add code
Bookmark button
Alert button
Sep 02, 2020
Guillermo Cámbara, Jordi Luque, Mireia Farrús

Figure 1 for Convolutional Speech Recognition with Pitch and Voice Quality Features
Figure 2 for Convolutional Speech Recognition with Pitch and Voice Quality Features
Viaarxiv icon

Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features

Add code
Bookmark button
Alert button
May 23, 2023
Chenglong Wang, Jiangyan Yi, Jianhua Tao, Chuyuan Zhang, Shuai Zhang, Xun Chen

Figure 1 for Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
Figure 2 for Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
Figure 3 for Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
Figure 4 for Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
Viaarxiv icon

Federated Acoustic Modeling For Automatic Speech Recognition

Feb 08, 2021
Xiaodong Cui, Songtao Lu, Brian Kingsbury

Figure 1 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 2 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 3 for Federated Acoustic Modeling For Automatic Speech Recognition
Figure 4 for Federated Acoustic Modeling For Automatic Speech Recognition
Viaarxiv icon

Improving Transformer-based Speech Recognition Using Unsupervised Pre-training

Oct 22, 2019
Dongwei Jiang, Xiaoning Lei, Wubo Li, Ne Luo, Yuxuan Hu, Wei Zou, Xiangang Li

Figure 1 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 2 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 3 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Figure 4 for Improving Transformer-based Speech Recognition Using Unsupervised Pre-training
Viaarxiv icon

Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition

Add code
Bookmark button
Alert button
Sep 17, 2021
Guangzhi Sun, Chao Zhang, Philip C. Woodland

Figure 1 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 2 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 3 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Figure 4 for Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition
Viaarxiv icon

Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications

Jan 14, 2021
Yoo Rhee Oh, Kiyoung Park, Jeon Gyu Park

Figure 1 for Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications
Figure 2 for Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications
Figure 3 for Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications
Viaarxiv icon

StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition

Aug 10, 2021
Shoki Sakamoto, Akira Taniguchi, Tadahiro Taniguchi, Hirokazu Kameoka

Figure 1 for StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition
Figure 2 for StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition
Figure 3 for StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition
Figure 4 for StarGAN-VC+ASR: StarGAN-based Non-Parallel Voice Conversion Regularized by Automatic Speech Recognition
Viaarxiv icon

Training Autoregressive Speech Recognition Models with Limited in-domain Supervision

Oct 27, 2022
Chak-Fai Li, Francis Keith, William Hartmann, Matthew Snover

Figure 1 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 2 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 3 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Figure 4 for Training Autoregressive Speech Recognition Models with Limited in-domain Supervision
Viaarxiv icon

Memory-efficient Speech Recognition on Smart Devices

Feb 23, 2021
Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra

Figure 1 for Memory-efficient Speech Recognition on Smart Devices
Figure 2 for Memory-efficient Speech Recognition on Smart Devices
Figure 3 for Memory-efficient Speech Recognition on Smart Devices
Figure 4 for Memory-efficient Speech Recognition on Smart Devices
Viaarxiv icon