Alert button

"speech recognition": models, code, and papers
Alert button

LRWR: Large-Scale Benchmark for Lip Reading in Russian language

Sep 14, 2021
Evgeniy Egorov, Vasily Kostyumov, Mikhail Konyk, Sergey Kolesnikov

Figure 1 for LRWR: Large-Scale Benchmark for Lip Reading in Russian language
Figure 2 for LRWR: Large-Scale Benchmark for Lip Reading in Russian language
Viaarxiv icon

The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments

Add code
Bookmark button
Alert button
Oct 06, 2017
Mirco Ravanelli, Maurizio Omologo

Figure 1 for The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Figure 2 for The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Figure 3 for The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Figure 4 for The DIRHA-English corpus and related tasks for distant-speech recognition in domestic environments
Viaarxiv icon

An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition

Jul 22, 2021
Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao, Abeer Alwan

Figure 1 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 2 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 3 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Figure 4 for An Improved Single Step Non-autoregressive Transformer for Automatic Speech Recognition
Viaarxiv icon

Beyond $L_p$ clipping: Equalization-based Psychoacoustic Attacks against ASRs

Add code
Bookmark button
Alert button
Oct 25, 2021
Hadi Abdullah, Muhammad Sajidur Rahman, Christian Peeters, Cassidy Gibson, Washington Garcia, Vincent Bindschaedler, Thomas Shrimpton, Patrick Traynor

Figure 1 for Beyond $L_p$ clipping: Equalization-based Psychoacoustic Attacks against ASRs
Figure 2 for Beyond $L_p$ clipping: Equalization-based Psychoacoustic Attacks against ASRs
Figure 3 for Beyond $L_p$ clipping: Equalization-based Psychoacoustic Attacks against ASRs
Figure 4 for Beyond $L_p$ clipping: Equalization-based Psychoacoustic Attacks against ASRs
Viaarxiv icon

Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning

Add code
Bookmark button
Alert button
Oct 15, 2021
Toshiko Shibano, Xinyi Zhang, Mia Taige Li, Haejin Cho, Peter Sullivan, Muhammad Abdul-Mageed

Figure 1 for Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning
Figure 2 for Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning
Figure 3 for Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning
Figure 4 for Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning
Viaarxiv icon

Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model

Sep 06, 2021
Zhongwei Teng, Quchen Fu, Jules White, Maria Powell, Douglas C. Schmidt

Figure 1 for Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model
Figure 2 for Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model
Figure 3 for Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model
Figure 4 for Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model
Viaarxiv icon

Deep Residual Local Feature Learning for Speech Emotion Recognition

Nov 19, 2020
Sattaya Singkul, Thakorn Chatchaisathaporn, Boontawee Suntisrivaraporn, Kuntpong Woraratpanya

Figure 1 for Deep Residual Local Feature Learning for Speech Emotion Recognition
Figure 2 for Deep Residual Local Feature Learning for Speech Emotion Recognition
Figure 3 for Deep Residual Local Feature Learning for Speech Emotion Recognition
Figure 4 for Deep Residual Local Feature Learning for Speech Emotion Recognition
Viaarxiv icon

Building competitive direct acoustics-to-word models for English conversational speech recognition

Dec 08, 2017
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny

Figure 1 for Building competitive direct acoustics-to-word models for English conversational speech recognition
Figure 2 for Building competitive direct acoustics-to-word models for English conversational speech recognition
Figure 3 for Building competitive direct acoustics-to-word models for English conversational speech recognition
Figure 4 for Building competitive direct acoustics-to-word models for English conversational speech recognition
Viaarxiv icon

Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition

Dec 27, 2016
Zewang Zhang, Zheng Sun, Jiaqi Liu, Jingwen Chen, Zhao Huo, Xiao Zhang

Figure 1 for Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition
Figure 2 for Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition
Figure 3 for Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition
Figure 4 for Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition
Viaarxiv icon