Alert button

"speech recognition": models, code, and papers
Alert button

GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition

Jun 16, 2023
Yu Pan, Yanni Hu, Yuguang Yang, Jixun Yao, Wen Fei, Lei Ma, Heng Lu

Figure 1 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Figure 2 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Figure 3 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Viaarxiv icon

deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition

Feb 28, 2023
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma

Figure 1 for deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition
Figure 2 for deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition
Figure 3 for deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition
Figure 4 for deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition
Viaarxiv icon

Stabilising and accelerating light gated recurrent units for automatic speech recognition

Feb 16, 2023
Adel Moumen, Titouan Parcollet

Figure 1 for Stabilising and accelerating light gated recurrent units for automatic speech recognition
Figure 2 for Stabilising and accelerating light gated recurrent units for automatic speech recognition
Viaarxiv icon

Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks

Aug 18, 2023
Shu Wang, Kun Sun, Qi Li

Figure 1 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 2 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 3 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Figure 4 for Compensating Removed Frequency Components: Thwarting Voice Spectrum Reduction Attacks
Viaarxiv icon

Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages

Jun 13, 2023
Claytone Sikasote, Kalinda Siaminwe, Stanly Mwape, Bangiwe Zulu, Mofya Phiri, Martin Phiri, David Zulu, Mayumbo Nyirenda, Antonios Anastasopoulos

Figure 1 for Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages
Figure 2 for Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages
Figure 3 for Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages
Figure 4 for Zambezi Voice: A Multilingual Speech Corpus for Zambian Languages
Viaarxiv icon

Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective

Nov 05, 2022
Hannaneh B. Pasandi, Haniyeh B. Pasandi

Figure 1 for Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective
Figure 2 for Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective
Figure 3 for Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective
Figure 4 for Evaluation of Automated Speech Recognition Systems for Conversational Speech: A Linguistic Perspective
Viaarxiv icon

A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization

Jul 24, 2023
Edward Fish, Umberto Michieli, Mete Ozay

Viaarxiv icon

Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic Information

Jul 21, 2023
Dejan Porjazovski, Tamás Grósz, Mikko Kurimo

Viaarxiv icon

Continual Learning for On-Device Speech Recognition using Disentangled Conformers

Dec 13, 2022
Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, Abdelrahman Mohamed

Figure 1 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 2 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 3 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 4 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Viaarxiv icon