Alert button

"speech recognition": models, code, and papers
Alert button

Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model

Add code
Bookmark button
Alert button
Feb 22, 2021
Junwei Liao, Yu Shi, Ming Gong, Linjun Shou, Sefik Eskimez, Liyang Lu, Hong Qu, Michael Zeng

Figure 1 for Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Figure 2 for Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Figure 3 for Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model
Viaarxiv icon

Assessing ASR Model Quality on Disordered Speech using BERTScore

Add code
Bookmark button
Alert button
Sep 21, 2022
Jimmy Tobin, Qisheng Li, Subhashini Venugopalan, Katie Seaver, Richard Cave, Katrin Tomanek

Figure 1 for Assessing ASR Model Quality on Disordered Speech using BERTScore
Figure 2 for Assessing ASR Model Quality on Disordered Speech using BERTScore
Figure 3 for Assessing ASR Model Quality on Disordered Speech using BERTScore
Figure 4 for Assessing ASR Model Quality on Disordered Speech using BERTScore
Viaarxiv icon

FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 09, 2021
Yichong Leng, Xu Tan, Linchen Zhu, Jin Xu, Renqian Luo, Linquan Liu, Tao Qin, Xiang-Yang Li, Ed Lin, Tie-Yan Liu

Figure 1 for FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Figure 2 for FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Figure 3 for FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Figure 4 for FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition
Viaarxiv icon

Deep Graph Random Process for Relational-Thinking-Based Speech Recognition

Jul 08, 2020
Hengguan Huang, Fuzhao Xue, Hao Wang, Ye Wang

Figure 1 for Deep Graph Random Process for Relational-Thinking-Based Speech Recognition
Figure 2 for Deep Graph Random Process for Relational-Thinking-Based Speech Recognition
Figure 3 for Deep Graph Random Process for Relational-Thinking-Based Speech Recognition
Figure 4 for Deep Graph Random Process for Relational-Thinking-Based Speech Recognition
Viaarxiv icon

Non-Autoregressive Transformer Automatic Speech Recognition

Nov 10, 2019
Nanxin Chen, Shinji Watanabe, Jesús Villalba, Najim Dehak

Figure 1 for Non-Autoregressive Transformer Automatic Speech Recognition
Figure 2 for Non-Autoregressive Transformer Automatic Speech Recognition
Figure 3 for Non-Autoregressive Transformer Automatic Speech Recognition
Figure 4 for Non-Autoregressive Transformer Automatic Speech Recognition
Viaarxiv icon

Voice Conversion Can Improve ASR in Very Low-Resource Settings

Add code
Bookmark button
Alert button
Nov 04, 2021
Matthew Baas, Herman Kamper

Figure 1 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 2 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 3 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Figure 4 for Voice Conversion Can Improve ASR in Very Low-Resource Settings
Viaarxiv icon

Compact Graph Architecture for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Aug 06, 2020
A. Shirian, T. Guha

Figure 1 for Compact Graph Architecture for Speech Emotion Recognition
Figure 2 for Compact Graph Architecture for Speech Emotion Recognition
Figure 3 for Compact Graph Architecture for Speech Emotion Recognition
Figure 4 for Compact Graph Architecture for Speech Emotion Recognition
Viaarxiv icon

Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments

Aug 27, 2022
Muskan Garg, Naveen Aggarwal

Figure 1 for Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments
Figure 2 for Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments
Figure 3 for Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments
Figure 4 for Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments
Viaarxiv icon

Generalizing in the Real World with Representation Learning

Add code
Bookmark button
Alert button
Oct 18, 2022
Tegan Maharaj

Figure 1 for Generalizing in the Real World with Representation Learning
Figure 2 for Generalizing in the Real World with Representation Learning
Figure 3 for Generalizing in the Real World with Representation Learning
Figure 4 for Generalizing in the Real World with Representation Learning
Viaarxiv icon