Alert button

"speech recognition": models, code, and papers
Alert button

MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition

Oct 29, 2023
Muhammad Umar Farooq, Rehan Ahmad, Thomas Hain

Figure 1 for MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition
Figure 2 for MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition
Figure 3 for MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition
Figure 4 for MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition
Viaarxiv icon

Leveraged Mel spectrograms using Harmonic and Percussive Components in Speech Emotion Recognition

Dec 18, 2023
David Hason Rudd, Huan Huo, Guandong Xu

Viaarxiv icon

Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks

Add code
Bookmark button
Alert button
Sep 18, 2023
Soumi Maiti, Yifan Peng, Shukjae Choi, Jee-weon Jung, Xuankai Chang, Shinji Watanabe

Figure 1 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 2 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 3 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Figure 4 for Voxtlm: unified decoder-only models for consolidating speech recognition/synthesis and speech/text continuation tasks
Viaarxiv icon

On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition

Nov 14, 2023
Xiaohan Shi, Jiajun He, Xingfeng Li, Tomoki Toda

Viaarxiv icon

VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System

Oct 27, 2023
Abdul Waheed, Bashar Talafha, Peter Sullivan, AbdelRahim Elmadany, Muhammad Abdul-Mageed

Figure 1 for VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System
Figure 2 for VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System
Figure 3 for VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System
Figure 4 for VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System
Viaarxiv icon

TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR

Jan 06, 2024
Nagarathna Ravi, Thishyan Raj T, Vipul Arora

Viaarxiv icon

The GUA-Speech System Description for CNVSRC Challenge 2023

Dec 12, 2023
Shengqiang Li, Chao Lei, Baozhong Ma, Binbin Zhang, Fuping Pan

Viaarxiv icon

Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition

Sep 21, 2023
Yang Li, Liangzhen Lai, Yuan Shangguan, Forrest N. Iandola, Ernie Chang, Yangyang Shi, Vikas Chandra

Figure 1 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 2 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 3 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Figure 4 for Folding Attention: Memory and Power Optimization for On-Device Transformer-based Streaming Speech Recognition
Viaarxiv icon

Correction Focused Language Model Training for Speech Recognition

Oct 17, 2023
Yingyi Ma, Zhe Liu, Ozlem Kalinli

Viaarxiv icon

Ms-senet: Enhancing Speech Emotion Recognition Through Multi-scale Feature Fusion With Squeeze-and-excitation Blocks

Dec 19, 2023
Mengbo Li, Yulun Wu, Dichucheng Li, Yuanzhong Zheng, Yaoxuan Wang, Haojun Fei

Viaarxiv icon