Alert button

"speech recognition": models, code, and papers
Alert button

Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model

Feb 27, 2023
Jaeyoung Huh, Sangjoon Park, Jeong Eun Lee, Jong Chul Ye

Figure 1 for Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model
Figure 2 for Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model
Figure 3 for Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model
Figure 4 for Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model
Viaarxiv icon

TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices

Aug 23, 2020
Alexander Wong, Mahmoud Famouri, Maya Pavlova, Siddharth Surana

Figure 1 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 2 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Figure 3 for TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
Viaarxiv icon

Speech Corpora Divergence Based Unsupervised Data Selection for ASR

Feb 26, 2023
Changfeng Gao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 2 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 3 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 4 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Viaarxiv icon

4-bit Conformer with Native Quantization Aware Training for Speech Recognition

Mar 29, 2022
Shaojin Ding, Phoenix Meadowlark, Yanzhang He, Lukasz Lew, Shivani Agrawal, Oleg Rybakov

Figure 1 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 2 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 3 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Figure 4 for 4-bit Conformer with Native Quantization Aware Training for Speech Recognition
Viaarxiv icon

Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition

Mar 29, 2022
Jingyu Sun, Guiping Zhong, Dinghao Zhou, Baoxiang Li, Yiran Zhong

Figure 1 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 2 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 3 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Figure 4 for Locality Matters: A Locality-Biased Linear Attention for Automatic Speech Recognition
Viaarxiv icon

Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition

Add code
Bookmark button
Alert button
Aug 13, 2021
Anurenjan Purushothaman, Anirudh Sreeram, Rohit Kumar, Sriram Ganapathy

Figure 1 for Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition
Figure 2 for Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition
Figure 3 for Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition
Figure 4 for Dereverberation of Autoregressive Envelopes for Far-field Speech Recognition
Viaarxiv icon

Visual Information Matters for ASR Error Correction

Add code
Bookmark button
Alert button
Mar 16, 2023
Vanya Bannihatti Kumar, Shanbo Cheng, Ningxin Peng, Yuchen Zhang

Figure 1 for Visual Information Matters for ASR Error Correction
Figure 2 for Visual Information Matters for ASR Error Correction
Figure 3 for Visual Information Matters for ASR Error Correction
Figure 4 for Visual Information Matters for ASR Error Correction
Viaarxiv icon

An Empirical Study of Language Model Integration for Transducer based Speech Recognition

Mar 31, 2022
Huahuan Zheng, Keyu An, Zhijian Ou, Chen Huang, Ke Ding, Guanglu Wan

Figure 1 for An Empirical Study of Language Model Integration for Transducer based Speech Recognition
Figure 2 for An Empirical Study of Language Model Integration for Transducer based Speech Recognition
Figure 3 for An Empirical Study of Language Model Integration for Transducer based Speech Recognition
Viaarxiv icon