Alert button

"speech recognition": models, code, and papers
Alert button

Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation

Add code
Bookmark button
Alert button
Dec 27, 2022
Tomer Wullach, Shlomo E. Chazan

Figure 1 for Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Figure 2 for Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Figure 3 for Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Figure 4 for Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation
Viaarxiv icon

Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition

Jul 26, 2021
Samuel Kessler, Bethan Thomas, Salah Karout

Figure 1 for Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition
Figure 2 for Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition
Figure 3 for Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition
Figure 4 for Continual-wav2vec2: an Application of Continual Learning for Self-Supervised Automatic Speech Recognition
Viaarxiv icon

BrainBERT: Self-supervised representation learning for intracranial recordings

Add code
Bookmark button
Alert button
Feb 28, 2023
Christopher Wang, Vighnesh Subramaniam, Adam Uri Yaari, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu

Figure 1 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 2 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 3 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 4 for BrainBERT: Self-supervised representation learning for intracranial recordings
Viaarxiv icon

Practical Speech Recognition with HTK

Aug 06, 2019
Zulkarnaen Hatala

Figure 1 for Practical Speech Recognition with HTK
Figure 2 for Practical Speech Recognition with HTK
Figure 3 for Practical Speech Recognition with HTK
Figure 4 for Practical Speech Recognition with HTK
Viaarxiv icon

SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks

Add code
Bookmark button
Alert button
Dec 20, 2022
Suwon Shon, Siddhant Arora, Chyi-Jiunn Lin, Ankita Pasad, Felix Wu, Roshan Sharma, Wei-Lun Wu, Hung-Yi Lee, Karen Livescu, Shinji Watanabe

Figure 1 for SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Figure 2 for SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Figure 3 for SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Figure 4 for SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks
Viaarxiv icon

CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning

Add code
Bookmark button
Alert button
Mar 22, 2023
Yiting Cheng, Fangyun Wei, Jianmin Bao, Dong Chen, Wenqiang Zhang

Figure 1 for CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning
Figure 2 for CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning
Figure 3 for CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning
Figure 4 for CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning
Viaarxiv icon

ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition

Feb 10, 2022
Dennis Pinto, Jose-María Arnau, Antonio González

Figure 1 for ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
Figure 2 for ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
Figure 3 for ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
Figure 4 for ASRPU: A Programmable Accelerator for Low-Power Automatic Speech Recognition
Viaarxiv icon

DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition

Add code
Bookmark button
Alert button
Aug 01, 2022
Z. Guo, C. Chen, E. S. Chng

Figure 1 for DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition
Figure 2 for DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition
Figure 3 for DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition
Figure 4 for DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition
Viaarxiv icon

Robust Speech Recognition via Large-Scale Weak Supervision

Add code
Bookmark button
Alert button
Dec 06, 2022
Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, Ilya Sutskever

Figure 1 for Robust Speech Recognition via Large-Scale Weak Supervision
Figure 2 for Robust Speech Recognition via Large-Scale Weak Supervision
Figure 3 for Robust Speech Recognition via Large-Scale Weak Supervision
Figure 4 for Robust Speech Recognition via Large-Scale Weak Supervision
Viaarxiv icon

Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels

Sep 20, 2021
Dmitriy Serdyuk, Otavio Braga, Olivier Siohan

Figure 1 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 2 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 3 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Figure 4 for Audio-Visual Speech Recognition is Worth 32$\times$32$\times$8 Voxels
Viaarxiv icon