Alert button

"speech recognition": models, code, and papers
Alert button

Audio Visual Speech Recognition using Deep Recurrent Neural Networks

Nov 09, 2016
Abhinav Thanda, Shankar M Venkatesan

Figure 1 for Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Figure 2 for Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Figure 3 for Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Figure 4 for Audio Visual Speech Recognition using Deep Recurrent Neural Networks
Viaarxiv icon

Multitask Learning with CTC and Segmental CRF for Speech Recognition

Jun 05, 2017
Liang Lu, Lingpeng Kong, Chris Dyer, Noah A. Smith

Figure 1 for Multitask Learning with CTC and Segmental CRF for Speech Recognition
Figure 2 for Multitask Learning with CTC and Segmental CRF for Speech Recognition
Figure 3 for Multitask Learning with CTC and Segmental CRF for Speech Recognition
Figure 4 for Multitask Learning with CTC and Segmental CRF for Speech Recognition
Viaarxiv icon

A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database

Dec 08, 2019
Hossein Zeinali, Lukáš Burget, Jan "Honza'' Černocký

Figure 1 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 2 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 3 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Figure 4 for A Multi Purpose and Large Scale Speech Corpus in Persian and English for Speaker and Speech Recognition: the DeepMine Database
Viaarxiv icon

CTC Variations Through New WFST Topologies

Oct 06, 2021
Aleksandr Laptev, Somshubra Majumdar, Boris Ginsburg

Figure 1 for CTC Variations Through New WFST Topologies
Figure 2 for CTC Variations Through New WFST Topologies
Figure 3 for CTC Variations Through New WFST Topologies
Figure 4 for CTC Variations Through New WFST Topologies
Viaarxiv icon

Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition

Jun 14, 2017
Liang Lu, Steve Renals

Figure 1 for Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition
Figure 2 for Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition
Figure 3 for Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition
Figure 4 for Small-footprint Deep Neural Networks with Highway Connections for Speech Recognition
Viaarxiv icon

SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems

Jul 16, 2020
Hadi Abdullah, Kevin Warren, Vincent Bindschaedler, Nicolas Papernot, Patrick Traynor

Figure 1 for SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 2 for SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 3 for SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Figure 4 for SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems
Viaarxiv icon

Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech

Oct 04, 2021
Ying Qin, Wei Liu, Zhiyuan Peng, Si-Ioi Ng, Jingyu Li, Haibo Hu, Tan Lee

Figure 1 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 2 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 3 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 4 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Viaarxiv icon

State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions

Oct 01, 2019
Kyu J. Han, Ramon Prieto, Kaixing Wu, Tao Ma

Figure 1 for State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions
Figure 2 for State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions
Figure 3 for State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions
Figure 4 for State-of-the-Art Speech Recognition Using Multi-Stream Self-Attention With Dilated 1D Convolutions
Viaarxiv icon

Deep transfer learning for partial differential equations under conditional shift with DeepONet

Apr 20, 2022
Somdatta Goswami, Katiana Kontolati, Michael D. Shields, George Em Karniadakis

Figure 1 for Deep transfer learning for partial differential equations under conditional shift with DeepONet
Figure 2 for Deep transfer learning for partial differential equations under conditional shift with DeepONet
Figure 3 for Deep transfer learning for partial differential equations under conditional shift with DeepONet
Figure 4 for Deep transfer learning for partial differential equations under conditional shift with DeepONet
Viaarxiv icon

Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks

Mar 08, 2013
Dong Yu, Michael L. Seltzer, Jinyu Li, Jui-Ting Huang, Frank Seide

Figure 1 for Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks
Figure 2 for Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks
Figure 3 for Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks
Figure 4 for Feature Learning in Deep Neural Networks - Studies on Speech Recognition Tasks
Viaarxiv icon