Alert button

"speech recognition": models, code, and papers
Alert button

Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning

Add code
Bookmark button
Alert button
Oct 27, 2022
Qiu-Shi Zhu, Long Zhou, Jie Zhang, Shu-Jie Liu, Yu-Chen Hu, Li-Rong Dai

Figure 1 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 2 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 3 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Figure 4 for Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning
Viaarxiv icon

Visual Speech Recognition

Add code
Bookmark button
Alert button
Sep 03, 2014
Ahmad B. A. Hassanat

Figure 1 for Visual Speech Recognition
Figure 2 for Visual Speech Recognition
Figure 3 for Visual Speech Recognition
Figure 4 for Visual Speech Recognition
Viaarxiv icon

Multi-Classifier Interactive Learning for Ambiguous Speech Emotion Recognition

Dec 12, 2020
Ying Zhou, Xuefeng Liang, Yu Gu, Yifei Yin, Longshan Yao

Figure 1 for Multi-Classifier Interactive Learning for Ambiguous Speech Emotion Recognition
Figure 2 for Multi-Classifier Interactive Learning for Ambiguous Speech Emotion Recognition
Figure 3 for Multi-Classifier Interactive Learning for Ambiguous Speech Emotion Recognition
Figure 4 for Multi-Classifier Interactive Learning for Ambiguous Speech Emotion Recognition
Viaarxiv icon

Dynamic Sparsity Neural Networks for Automatic Speech Recognition

May 16, 2020
Zhaofeng Wu, Ding Zhao, Qiao Liang, Jiahui Yu, Anmol Gulati, Ruoming Pang

Figure 1 for Dynamic Sparsity Neural Networks for Automatic Speech Recognition
Figure 2 for Dynamic Sparsity Neural Networks for Automatic Speech Recognition
Figure 3 for Dynamic Sparsity Neural Networks for Automatic Speech Recognition
Viaarxiv icon

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Apr 18, 2019
Daniel S. Park, William Chan, Yu Zhang, Chung-Cheng Chiu, Barret Zoph, Ekin D. Cubuk, Quoc V. Le

Figure 1 for SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Figure 2 for SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Figure 3 for SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Figure 4 for SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Viaarxiv icon

Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition

Add code
Bookmark button
Alert button
Jan 22, 2019
Julian Salazar, Katrin Kirchhoff, Zhiheng Huang

Figure 1 for Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Figure 2 for Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Figure 3 for Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Figure 4 for Self-Attention Networks for Connectionist Temporal Classification in Speech Recognition
Viaarxiv icon

Speech Emotion Recognition using Self-Supervised Features

Feb 07, 2022
Edmilson Morais, Ron Hoory, Weizhong Zhu, Itai Gat, Matheus Damasceno, Hagai Aronowitz

Figure 1 for Speech Emotion Recognition using Self-Supervised Features
Figure 2 for Speech Emotion Recognition using Self-Supervised Features
Figure 3 for Speech Emotion Recognition using Self-Supervised Features
Figure 4 for Speech Emotion Recognition using Self-Supervised Features
Viaarxiv icon

Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition

Add code
Bookmark button
Alert button
May 19, 2020
Yan Gao, Titouan Parcollet, Nicholas Lane

Figure 1 for Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition
Figure 2 for Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition
Figure 3 for Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech Recognition
Viaarxiv icon

Bayesian Recurrent Units and the Forward-Backward Algorithm

Add code
Bookmark button
Alert button
Jul 21, 2022
Alexandre Bittar, Philip N. Garner

Figure 1 for Bayesian Recurrent Units and the Forward-Backward Algorithm
Figure 2 for Bayesian Recurrent Units and the Forward-Backward Algorithm
Figure 3 for Bayesian Recurrent Units and the Forward-Backward Algorithm
Viaarxiv icon

Acoustic-to-articulatory Speech Inversion with Multi-task Learning

May 27, 2022
Yashish M. Siriwardena, Ganesh Sivaraman, Carol Espy-Wilson

Figure 1 for Acoustic-to-articulatory Speech Inversion with Multi-task Learning
Figure 2 for Acoustic-to-articulatory Speech Inversion with Multi-task Learning
Figure 3 for Acoustic-to-articulatory Speech Inversion with Multi-task Learning
Figure 4 for Acoustic-to-articulatory Speech Inversion with Multi-task Learning
Viaarxiv icon