Alert button

"speech recognition": models, code, and papers
Alert button

Exploring wav2vec 2.0 on speaker verification and language identification

Add code
Bookmark button
Alert button
Jan 14, 2021
Zhiyun Fan, Meng Li, Shiyu Zhou, Bo Xu

Figure 1 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 2 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 3 for Exploring wav2vec 2.0 on speaker verification and language identification
Figure 4 for Exploring wav2vec 2.0 on speaker verification and language identification
Viaarxiv icon

Affective Burst Detection from Speech using Kernel-fusion Dilated Convolutional Neural Networks

Oct 08, 2021
Berkay Kopru, Engin Erzin

Figure 1 for Affective Burst Detection from Speech using Kernel-fusion Dilated Convolutional Neural Networks
Figure 2 for Affective Burst Detection from Speech using Kernel-fusion Dilated Convolutional Neural Networks
Figure 3 for Affective Burst Detection from Speech using Kernel-fusion Dilated Convolutional Neural Networks
Figure 4 for Affective Burst Detection from Speech using Kernel-fusion Dilated Convolutional Neural Networks
Viaarxiv icon

Learning Efficient Representations for Keyword Spotting with Triplet Loss

Add code
Bookmark button
Alert button
Jan 12, 2021
Roman Vygon, Nikolay Mikhaylovskiy

Figure 1 for Learning Efficient Representations for Keyword Spotting with Triplet Loss
Figure 2 for Learning Efficient Representations for Keyword Spotting with Triplet Loss
Figure 3 for Learning Efficient Representations for Keyword Spotting with Triplet Loss
Figure 4 for Learning Efficient Representations for Keyword Spotting with Triplet Loss
Viaarxiv icon

Bandwidth Embeddings for Mixed-bandwidth Speech Recognition

Sep 05, 2019
Gautam Mantena, Ozlem Kalinli, Ossama Abdel-Hamid, Don McAllaster

Figure 1 for Bandwidth Embeddings for Mixed-bandwidth Speech Recognition
Figure 2 for Bandwidth Embeddings for Mixed-bandwidth Speech Recognition
Figure 3 for Bandwidth Embeddings for Mixed-bandwidth Speech Recognition
Figure 4 for Bandwidth Embeddings for Mixed-bandwidth Speech Recognition
Viaarxiv icon

Fixing Errors of the Google Voice Recognizer through Phonetic Distance Metrics

Feb 18, 2021
Diego Campos-Sobrino, Mario Campos-Soberanis, Iván Martínez-Chin, Víctor Uc-Cetina

Figure 1 for Fixing Errors of the Google Voice Recognizer through Phonetic Distance Metrics
Figure 2 for Fixing Errors of the Google Voice Recognizer through Phonetic Distance Metrics
Figure 3 for Fixing Errors of the Google Voice Recognizer through Phonetic Distance Metrics
Figure 4 for Fixing Errors of the Google Voice Recognizer through Phonetic Distance Metrics
Viaarxiv icon

Impact of ASR on Alzheimer's Disease Detection: All Errors are Equal, but Deletions are More Equal than Others

Apr 08, 2019
Aparna Balagopalan, Ksenia Shkaruta, Jekaterina Novikova

Figure 1 for Impact of ASR on Alzheimer's Disease Detection: All Errors are Equal, but Deletions are More Equal than Others
Figure 2 for Impact of ASR on Alzheimer's Disease Detection: All Errors are Equal, but Deletions are More Equal than Others
Figure 3 for Impact of ASR on Alzheimer's Disease Detection: All Errors are Equal, but Deletions are More Equal than Others
Figure 4 for Impact of ASR on Alzheimer's Disease Detection: All Errors are Equal, but Deletions are More Equal than Others
Viaarxiv icon

Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation

Sep 06, 2020
Akhil Mathur, Fahim Kawsar, Nadia Berthouze, Nicholas D. Lane

Figure 1 for Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation
Figure 2 for Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation
Figure 3 for Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation
Figure 4 for Libri-Adapt: A New Speech Dataset for Unsupervised Domain Adaptation
Viaarxiv icon

PhyAug: Physics-Directed Data Augmentation for Deep Sensing Model Transfer in Cyber-Physical Systems

Apr 19, 2021
Wenjie Luo, Zhenyu Yan, Qun Song, Rui Tan

Figure 1 for PhyAug: Physics-Directed Data Augmentation for Deep Sensing Model Transfer in Cyber-Physical Systems
Figure 2 for PhyAug: Physics-Directed Data Augmentation for Deep Sensing Model Transfer in Cyber-Physical Systems
Figure 3 for PhyAug: Physics-Directed Data Augmentation for Deep Sensing Model Transfer in Cyber-Physical Systems
Figure 4 for PhyAug: Physics-Directed Data Augmentation for Deep Sensing Model Transfer in Cyber-Physical Systems
Viaarxiv icon

BRDS: An FPGA-based LSTM Accelerator with Row-Balanced Dual-Ratio Sparsification

Jan 07, 2021
Seyed Abolfazl Ghasemzadeh, Erfan Bank Tavakoli, Mehdi Kamal, Ali Afzali-Kusha, Massoud Pedram

Figure 1 for BRDS: An FPGA-based LSTM Accelerator with Row-Balanced Dual-Ratio Sparsification
Figure 2 for BRDS: An FPGA-based LSTM Accelerator with Row-Balanced Dual-Ratio Sparsification
Figure 3 for BRDS: An FPGA-based LSTM Accelerator with Row-Balanced Dual-Ratio Sparsification
Figure 4 for BRDS: An FPGA-based LSTM Accelerator with Row-Balanced Dual-Ratio Sparsification
Viaarxiv icon

Joint Masked CPC and CTC Training for ASR

Oct 30, 2020
Chaitanya Talnikar, Tatiana Likhomanenko, Ronan Collobert, Gabriel Synnaeve

Figure 1 for Joint Masked CPC and CTC Training for ASR
Figure 2 for Joint Masked CPC and CTC Training for ASR
Figure 3 for Joint Masked CPC and CTC Training for ASR
Figure 4 for Joint Masked CPC and CTC Training for ASR
Viaarxiv icon