Alert button

"speech recognition": models, code, and papers
Alert button

Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition

Jan 22, 2021
Dennis Pinto, Jose-María Arnau, Antonio González

Figure 1 for Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition
Figure 2 for Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition
Figure 3 for Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition
Figure 4 for Exploiting Beam Search Confidence for Energy-Efficient Speech Recognition
Viaarxiv icon

A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition

Jul 03, 2022
Ying Hu, Yuwu Tang, Hao Huang, Liang He

Figure 1 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 2 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 3 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 4 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Viaarxiv icon

Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism

Jul 02, 2022
Kun Wei, Pengcheng Guo, Ning Jiang

Figure 1 for Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
Figure 2 for Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
Figure 3 for Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
Figure 4 for Improving Transformer-based Conversational ASR by Inter-Sentential Attention Mechanism
Viaarxiv icon

TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection

Oct 27, 2022
Piyush Behre, Sharman Tan, Amy Shah, Harini Kesavamoorthy, Shuangyu Chang, Fei Zuo, Chris Basoglu, Sayan Pathak

Figure 1 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 2 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 3 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Figure 4 for TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection
Viaarxiv icon

On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors

Oct 27, 2022
Zaharah Bukhsh, Aaqib Saeed

Figure 1 for On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Figure 2 for On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Figure 3 for On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Figure 4 for On Out-of-Distribution Detection for Audio with Deep Nearest Neighbors
Viaarxiv icon

Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance

Add code
Bookmark button
Alert button
Oct 27, 2022
Yuanzhe Chen, Ming Tu, Tang Li, Xin Li, Qiuqiang Kong, Jiaxin Li, Zhichao Wang, Qiao Tian, Yuping Wang, Yuxuan Wang

Figure 1 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 2 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 3 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Figure 4 for Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance
Viaarxiv icon

Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning

Add code
Bookmark button
Alert button
Oct 27, 2022
Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

Figure 1 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 2 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 3 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Figure 4 for Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning
Viaarxiv icon

Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead

Oct 27, 2022
Piyush Behre, Naveen Parihar, Sharman Tan, Amy Shah, Eva Sharma, Geoffrey Liu, Shuangyu Chang, Hosam Khalil, Chris Basoglu, Sayan Pathak

Figure 1 for Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead
Figure 2 for Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead
Figure 3 for Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead
Figure 4 for Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead
Viaarxiv icon

Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment

Oct 24, 2020
Ethan A. Chi, Julian Salazar, Katrin Kirchhoff

Figure 1 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 2 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 3 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Figure 4 for Align-Refine: Non-Autoregressive Speech Recognition via Iterative Realignment
Viaarxiv icon

On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches

Nov 16, 2022
Guilherme Schu, Parvaneh Janbakhshi, Ina Kodrasi

Figure 1 for On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches
Figure 2 for On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches
Figure 3 for On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches
Figure 4 for On using the UA-Speech and TORGO databases to validate automatic dysarthric speech classification approaches
Viaarxiv icon