Alert button

"speech recognition": models, code, and papers
Alert button

Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise

Aug 31, 2021
Mingyu Dong, Diqun Yan, Yongkang Gong, Rangding Wang

Figure 1 for Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise
Figure 2 for Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise
Figure 3 for Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise
Figure 4 for Adversarial Example Devastation and Detection on Speech Recognition System by Adding Random Noise
Viaarxiv icon

Residual Energy-Based Models for End-to-End Speech Recognition

Mar 25, 2021
Qiujia Li, Yu Zhang, Bo Li, Liangliang Cao, Philip C. Woodland

Figure 1 for Residual Energy-Based Models for End-to-End Speech Recognition
Figure 2 for Residual Energy-Based Models for End-to-End Speech Recognition
Figure 3 for Residual Energy-Based Models for End-to-End Speech Recognition
Figure 4 for Residual Energy-Based Models for End-to-End Speech Recognition
Viaarxiv icon

LiteLSTM Architecture Based on Weights Sharing for Recurrent Neural Networks

Jan 12, 2023
Nelly Elsayed, Zag ElSayed, Anthony S. Maida

Figure 1 for LiteLSTM Architecture Based on Weights Sharing for Recurrent Neural Networks
Figure 2 for LiteLSTM Architecture Based on Weights Sharing for Recurrent Neural Networks
Figure 3 for LiteLSTM Architecture Based on Weights Sharing for Recurrent Neural Networks
Figure 4 for LiteLSTM Architecture Based on Weights Sharing for Recurrent Neural Networks
Viaarxiv icon

U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition

Add code
Bookmark button
Alert button
Jul 07, 2021
Di Wu, Binbin Zhang, Chao Yang, Zhendong Peng, Wenjing Xia, Xiaoyu Chen, Xin Lei

Figure 1 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 2 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 3 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Figure 4 for U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition
Viaarxiv icon

On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches

Nov 16, 2022
Guilherme Schu, Parvaneh Janbakhshi, Ina Kodrasi

Figure 1 for On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches
Figure 2 for On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches
Figure 3 for On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches
Figure 4 for On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches
Viaarxiv icon

On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors

Oct 27, 2022
Zaharah Bukhsh, Aaqib Saeed

Figure 1 for On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Figure 2 for On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Figure 3 for On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Figure 4 for On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Viaarxiv icon

Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead

Oct 27, 2022
Piyush Behre, Naveen Parihar, Sharman Tan, Amy Shah, Eva Sharma, Geoffrey Liu, Shuangyu Chang, Hosam Khalil, Chris Basoglu, Sayan Pathak

Figure 1 for Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead
Figure 2 for Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead
Figure 3 for Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead
Figure 4 for Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead
Viaarxiv icon

Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Add code
Bookmark button
Alert button
Oct 27, 2022
Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

Figure 1 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 2 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 3 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 4 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Viaarxiv icon

TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection

Oct 27, 2022
Piyush Behre, Sharman Tan, Amy Shah, Harini Kesavamoorthy, Shuangyu Chang, Fei Zuo, Chris Basoglu, Sayan Pathak

Figure 1 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 2 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 3 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 4 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Viaarxiv icon

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance

Add code
Bookmark button
Alert button
Oct 27, 2022
Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 2 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 3 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 4 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Viaarxiv icon