Alert button

"speech recognition": models, code, and papers
Alert button

Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition

Sep 11, 2018
Krishan Rajaratnam, Kunal Shah, Jugal Kalita

Figure 1 for Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition
Figure 2 for Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition
Figure 3 for Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition
Figure 4 for Isolated and Ensemble Audio Preprocessing Methods for Detecting Adversarial Examples against Automatic Speech Recognition
Viaarxiv icon

Comparison and Analysis of New Curriculum Criteria for End-to-End ASR

Add code
Bookmark button
Alert button
Aug 10, 2022
Georgios Karakasidis, Tamás Grósz, Mikko Kurimo

Figure 1 for Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Figure 2 for Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Figure 3 for Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Figure 4 for Comparison and Analysis of New Curriculum Criteria for End-to-End ASR
Viaarxiv icon

Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer

Jul 29, 2022
Cong-Thanh Do, Mohan Li, Rama Doddipatla

Figure 1 for Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer
Figure 2 for Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer
Figure 3 for Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer
Figure 4 for Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer
Viaarxiv icon

Resolution limits on visual speech recognition

Oct 03, 2017
Helen L. Bear, Richard Harvey, Barry-John Theobald, Yuxuan Lan

Figure 1 for Resolution limits on visual speech recognition
Figure 2 for Resolution limits on visual speech recognition
Figure 3 for Resolution limits on visual speech recognition
Figure 4 for Resolution limits on visual speech recognition
Viaarxiv icon

Recurrent Deep Stacking Networks for Speech Recognition

Dec 14, 2016
Peidong Wang, Zhongqiu Wang, Deliang Wang

Figure 1 for Recurrent Deep Stacking Networks for Speech Recognition
Figure 2 for Recurrent Deep Stacking Networks for Speech Recognition
Viaarxiv icon

End-to-End Multimodal Speech Recognition

Apr 25, 2018
Shruti Palaskar, Ramon Sanabria, Florian Metze

Figure 1 for End-to-End Multimodal Speech Recognition
Viaarxiv icon

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context

May 09, 2020
Wei Han, Zhengdong Zhang, Yu Zhang, Jiahui Yu, Chung-Cheng Chiu, James Qin, Anmol Gulati, Ruoming Pang, Yonghui Wu

Figure 1 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 2 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 3 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 4 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Viaarxiv icon

The Marchex 2018 English Conversational Telephone Speech Recognition System

Nov 05, 2018
Seongjun Hahm, Iroro Orife, Shane Walker, Jason Flaks

Figure 1 for The Marchex 2018 English Conversational Telephone Speech Recognition System
Figure 2 for The Marchex 2018 English Conversational Telephone Speech Recognition System
Figure 3 for The Marchex 2018 English Conversational Telephone Speech Recognition System
Figure 4 for The Marchex 2018 English Conversational Telephone Speech Recognition System
Viaarxiv icon

Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition

Add code
Bookmark button
Alert button
Jul 10, 2018
Chun-Fu Chen, Quanfu Fan, Neil Mallinar, Tom Sercu, Rogerio Feris

Figure 1 for Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Figure 2 for Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Figure 3 for Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Figure 4 for Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Viaarxiv icon