Alert button

"speech recognition": models, code, and papers
Alert button

Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity

Add code
Bookmark button
Alert button
Nov 02, 2021
Peter Wu, Jiatong Shi, Yifan Zhong, Shinji Watanabe, Alan W Black

Figure 1 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 2 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 3 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Figure 4 for Cross-lingual Transfer for Speech Processing using Acoustic Language Similarity
Viaarxiv icon

Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization

Feb 13, 2019
Jorge, Davila-Chacon, Jindong, Liu, Stefan, Wermter

Figure 1 for Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Figure 2 for Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Figure 3 for Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Figure 4 for Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization
Viaarxiv icon

Streaming parallel transducer beam search with fast-slow cascaded encoders

Mar 29, 2022
Jay Mahadeokar, Yangyang Shi, Ke Li, Duc Le, Jiedan Zhu, Vikas Chandra, Ozlem Kalinli, Michael L Seltzer

Figure 1 for Streaming parallel transducer beam search with fast-slow cascaded encoders
Figure 2 for Streaming parallel transducer beam search with fast-slow cascaded encoders
Figure 3 for Streaming parallel transducer beam search with fast-slow cascaded encoders
Figure 4 for Streaming parallel transducer beam search with fast-slow cascaded encoders
Viaarxiv icon

Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments

Add code
Bookmark button
Alert button
Feb 21, 2022
Mario Esparza

Figure 1 for Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Figure 2 for Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Figure 3 for Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Figure 4 for Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning Environments
Viaarxiv icon

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Add code
Bookmark button
Alert button
Mar 29, 2022
Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li

Figure 1 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 2 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 3 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 4 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Viaarxiv icon

MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification

Add code
Bookmark button
Alert button
Mar 29, 2022
Yang Zhang, Zhiqiang Lv, Haibin Wu, Shanshan Zhang, Pengfei Hu, Zhiyong Wu, Hung-yi Lee, Helen Meng

Figure 1 for MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Figure 2 for MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Figure 3 for MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Figure 4 for MFA-Conformer: Multi-scale Feature Aggregation Conformer for Automatic Speaker Verification
Viaarxiv icon

Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR

Add code
Bookmark button
Alert button
Mar 29, 2022
Fangyuan Wang, Bo Xu

Figure 1 for Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR
Figure 2 for Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR
Figure 3 for Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR
Figure 4 for Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR
Viaarxiv icon

Influence Functions for Sequence Tagging Models

Add code
Bookmark button
Alert button
Oct 25, 2022
Sarthak Jain, Varun Manjunatha, Byron C. Wallace, Ani Nenkova

Figure 1 for Influence Functions for Sequence Tagging Models
Figure 2 for Influence Functions for Sequence Tagging Models
Figure 3 for Influence Functions for Sequence Tagging Models
Figure 4 for Influence Functions for Sequence Tagging Models
Viaarxiv icon

Integrating HMM-Based Speech Recognition With Direct Manipulation In A Multimodal Korean Natural Language Interface

Nov 18, 1996
Geunbae Lee, Jong-Hyeok Lee, Sangeok Kim

Figure 1 for Integrating HMM-Based Speech Recognition With Direct Manipulation In A Multimodal Korean Natural Language Interface
Viaarxiv icon

Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications

Apr 25, 2022
Han Cai, Ji Lin, Yujun Lin, Zhijian Liu, Haotian Tang, Hanrui Wang, Ligeng Zhu, Song Han

Figure 1 for Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Figure 2 for Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Figure 3 for Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Figure 4 for Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Viaarxiv icon