Alert button

"speech recognition": models, code, and papers
Alert button

Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey

Feb 22, 2022
Ngoc Dung Huynh, Mohamed Reda Bouadjenek, Imran Razzak, Kevin Lee, Chetan Arora, Ali Hassani, Arkady Zaslavsky

Figure 1 for Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Figure 2 for Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Figure 3 for Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Figure 4 for Adversarial Attacks on Speech Recognition Systems for Mission-Critical Applications: A Survey
Viaarxiv icon

Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition

Oct 01, 2022
Jash Rathod, Nauman Dawalatabad, Shatrughan Singh, Dhananjaya Gowda

Figure 1 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 2 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 3 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 4 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Viaarxiv icon

Robust Self-Supervised Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Jan 05, 2022
Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed

Figure 1 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 2 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 3 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 4 for Robust Self-Supervised Audio-Visual Speech Recognition
Viaarxiv icon

Adaptive multilingual speech recognition with pretrained models

Add code
Bookmark button
Alert button
May 24, 2022
Ngoc-Quan Pham, Alex Waibel, Jan Niehues

Figure 1 for Adaptive multilingual speech recognition with pretrained models
Viaarxiv icon

Similarity and Content-based Phonetic Self Attention for Speech Recognition

Mar 28, 2022
Kyuhong Shim, Wonyong Sung

Figure 1 for Similarity and Content-based Phonetic Self Attention for Speech Recognition
Figure 2 for Similarity and Content-based Phonetic Self Attention for Speech Recognition
Figure 3 for Similarity and Content-based Phonetic Self Attention for Speech Recognition
Figure 4 for Similarity and Content-based Phonetic Self Attention for Speech Recognition
Viaarxiv icon

Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models

Oct 07, 2021
Liang-Hsuan Tseng, Yu-Kuan Fu, Heng-Jui Chang, Hung-yi Lee

Figure 1 for Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Figure 2 for Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Figure 3 for Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Figure 4 for Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models
Viaarxiv icon

Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator

Add code
Bookmark button
Alert button
May 30, 2023
Guangzhi Sun, Chao Zhang, Phil Woodland

Figure 1 for Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 2 for Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 3 for Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator
Figure 4 for Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer Generator
Viaarxiv icon

Pseudo-Labeling for Massively Multilingual Speech Recognition

Add code
Bookmark button
Alert button
Oct 30, 2021
Loren Lugosch, Tatiana Likhomanenko, Gabriel Synnaeve, Ronan Collobert

Figure 1 for Pseudo-Labeling for Massively Multilingual Speech Recognition
Figure 2 for Pseudo-Labeling for Massively Multilingual Speech Recognition
Figure 3 for Pseudo-Labeling for Massively Multilingual Speech Recognition
Figure 4 for Pseudo-Labeling for Massively Multilingual Speech Recognition
Viaarxiv icon

Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition

Oct 26, 2022
Sharman Tan, Piyush Behre, Nick Kibre, Issac Alphonso, Shuangyu Chang

Figure 1 for Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition
Figure 2 for Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition
Figure 3 for Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition
Figure 4 for Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition
Viaarxiv icon

Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition

Sep 13, 2022
Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno

Figure 1 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 2 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 3 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Figure 4 for Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition
Viaarxiv icon