Alert button

"speech recognition": models, code, and papers
Alert button

RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition

Feb 19, 2020
Peiyan Dong, Siyue Wang, Wei Niu, Chengming Zhang, Sheng Lin, Zhengang Li, Yifan Gong, Bin Ren, Xue Lin, Yanzhi Wang, Dingwen Tao

Figure 1 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 2 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 3 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Figure 4 for RTMobile: Beyond Real-Time Mobile Acceleration of RNNs for Speech Recognition
Viaarxiv icon

Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition

Jul 09, 2019
Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 2 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 3 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 4 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Viaarxiv icon

Adversarial synthesis based data-augmentation for code-switched spoken language identification

May 30, 2022
Parth Shastri, Chirag Patil, Poorval Wanere, Dr. Shrinivas Mahajan, Dr. Abhishek Bhatt, Dr. Hardik Sailor

Figure 1 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 2 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 3 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 4 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Viaarxiv icon

Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Oct 26, 2020
Chao-Han Huck Yang, Jun Qi, Samuel Yen-Chi Chen, Pin-Yu Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee

Figure 1 for Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition
Figure 2 for Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition
Figure 3 for Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition
Figure 4 for Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition
Viaarxiv icon

Multi-task Recurrent Model for True Multilingual Speech Recognition

Sep 27, 2016
Zhiyuan Tang, Lantian Li, Dong Wang

Figure 1 for Multi-task Recurrent Model for True Multilingual Speech Recognition
Figure 2 for Multi-task Recurrent Model for True Multilingual Speech Recognition
Figure 3 for Multi-task Recurrent Model for True Multilingual Speech Recognition
Figure 4 for Multi-task Recurrent Model for True Multilingual Speech Recognition
Viaarxiv icon

An Overview of Hindi Speech Recognition

May 09, 2013
Neema Mishra, Urmila Shrawankar, V M Thakare

Figure 1 for An Overview of Hindi Speech Recognition
Figure 2 for An Overview of Hindi Speech Recognition
Figure 3 for An Overview of Hindi Speech Recognition
Figure 4 for An Overview of Hindi Speech Recognition
Viaarxiv icon

Adversarial Attacks on ASR Systems: An Overview

Aug 03, 2022
Xiao Zhang, Hao Tan, Xuan Huang, Denghui Zhang, Keke Tang, Zhaoquan Gu

Figure 1 for Adversarial Attacks on ASR Systems: An Overview
Figure 2 for Adversarial Attacks on ASR Systems: An Overview
Figure 3 for Adversarial Attacks on ASR Systems: An Overview
Viaarxiv icon

Adversarial Joint Training with Self-Attention Mechanism for Robust End-to-End Speech Recognition

Apr 03, 2021
Lujun Li, Yikai Kang, Yuchen Shi, Ludwig Kürzinger, Tobias Watzel, Gerhard Rigoll

Figure 1 for Adversarial Joint Training with Self-Attention Mechanism for Robust End-to-End Speech Recognition
Figure 2 for Adversarial Joint Training with Self-Attention Mechanism for Robust End-to-End Speech Recognition
Figure 3 for Adversarial Joint Training with Self-Attention Mechanism for Robust End-to-End Speech Recognition
Figure 4 for Adversarial Joint Training with Self-Attention Mechanism for Robust End-to-End Speech Recognition
Viaarxiv icon

VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge

Add code
Bookmark button
Alert button
Feb 20, 2023
Jaesung Huh, Andrew Brown, Jee-weon Jung, Joon Son Chung, Arsha Nagrani, Daniel Garcia-Romero, Andrew Zisserman

Figure 1 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 2 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 3 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Figure 4 for VoxSRC 2022: The Fourth VoxCeleb Speaker Recognition Challenge
Viaarxiv icon