Alert button

"speech recognition": models, code, and papers
Alert button

Incremental Learning for End-to-End Automatic Speech Recognition

May 11, 2020
Li Fu, Xiaoxiao Li, Libo Zi

Figure 1 for Incremental Learning for End-to-End Automatic Speech Recognition
Figure 2 for Incremental Learning for End-to-End Automatic Speech Recognition
Figure 3 for Incremental Learning for End-to-End Automatic Speech Recognition
Figure 4 for Incremental Learning for End-to-End Automatic Speech Recognition
Viaarxiv icon

Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments

Add code
Bookmark button
Alert button
Nov 15, 2017
Ziteng Wang, Emmanuel Vincent, Romain Serizel, Yonghong Yan

Figure 1 for Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments
Figure 2 for Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments
Figure 3 for Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments
Figure 4 for Rank-1 Constrained Multichannel Wiener Filter for Speech Recognition in Noisy Environments
Viaarxiv icon

Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition

Nov 03, 2020
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong

Figure 1 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 2 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 3 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Figure 4 for Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition
Viaarxiv icon

FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition

Sep 15, 2021
Bonaventure F. P. Dossou, Yeno K. S. Gbenou

Figure 1 for FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
Figure 2 for FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
Figure 3 for FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
Figure 4 for FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
Viaarxiv icon

Compute Cost Amortized Transformer for Streaming ASR

Jul 05, 2022
Yi Xie, Jonathan Macoskey, Martin Radfar, Feng-Ju Chang, Brian King, Ariya Rastrow, Athanasios Mouchtaris, Grant P. Strimel

Figure 1 for Compute Cost Amortized Transformer for Streaming ASR
Figure 2 for Compute Cost Amortized Transformer for Streaming ASR
Figure 3 for Compute Cost Amortized Transformer for Streaming ASR
Figure 4 for Compute Cost Amortized Transformer for Streaming ASR
Viaarxiv icon

An Overview of Hindi Speech Recognition

May 09, 2013
Neema Mishra, Urmila Shrawankar, V M Thakare

Figure 1 for An Overview of Hindi Speech Recognition
Figure 2 for An Overview of Hindi Speech Recognition
Figure 3 for An Overview of Hindi Speech Recognition
Figure 4 for An Overview of Hindi Speech Recognition
Viaarxiv icon

Transformer-Transducer: End-to-End Speech Recognition with Self-Attention

Oct 28, 2019
Ching-Feng Yeh, Jay Mahadeokar, Kaustubh Kalgaonkar, Yongqiang Wang, Duc Le, Mahaveer Jain, Kjell Schubert, Christian Fuegen, Michael L. Seltzer

Figure 1 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 2 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 3 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Figure 4 for Transformer-Transducer: End-to-End Speech Recognition with Self-Attention
Viaarxiv icon

Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition

Jul 09, 2019
Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 2 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 3 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 4 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Viaarxiv icon

Multi-task Recurrent Model for True Multilingual Speech Recognition

Sep 27, 2016
Zhiyuan Tang, Lantian Li, Dong Wang

Figure 1 for Multi-task Recurrent Model for True Multilingual Speech Recognition
Figure 2 for Multi-task Recurrent Model for True Multilingual Speech Recognition
Figure 3 for Multi-task Recurrent Model for True Multilingual Speech Recognition
Figure 4 for Multi-task Recurrent Model for True Multilingual Speech Recognition
Viaarxiv icon

RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition

Feb 19, 2020
Peiyan Dong, Siyue Wang, Wei Niu, Chengming Zhang, Sheng Lin, Zhengang Li, Yifan Gong, Bin Ren, Xue Lin, Yanzhi Wang, Dingwen Tao

Figure 1 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 2 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 3 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 4 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Viaarxiv icon