Alert button

"speech recognition": models, code, and papers
Alert button

Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech

Jun 09, 2023
Shijun Wang, Jón Guðnason, Damian Borth

Figure 1 for Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech
Figure 2 for Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech
Figure 3 for Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech
Figure 4 for Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech
Viaarxiv icon

Accented Speech Recognition under the Indian context

Sep 11, 2022
Ankit Grover

Figure 1 for Accented Speech Recognition under the Indian context
Figure 2 for Accented Speech Recognition under the Indian context
Figure 3 for Accented Speech Recognition under the Indian context
Figure 4 for Accented Speech Recognition under the Indian context
Viaarxiv icon

ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition

Add code
Bookmark button
Alert button
May 25, 2023
Yuanchao Li, Zeyu Zhao, Ondrej Klejch, Peter Bell, Catherine Lai

Figure 1 for ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Figure 2 for ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Figure 3 for ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Figure 4 for ASR and Emotional Speech: A Word-Level Investigation of the Mutual Impact of Speech and Emotion Recognition
Viaarxiv icon

Continual Learning for On-Device Speech Recognition using Disentangled Conformers

Dec 13, 2022
Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, Abdelrahman Mohamed

Figure 1 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 2 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 3 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Figure 4 for Continual Learning for On-Device Speech Recognition using Disentangled Conformers
Viaarxiv icon

Stabilising and accelerating light gated recurrent units for automatic speech recognition

Add code
Bookmark button
Alert button
Feb 16, 2023
Adel Moumen, Titouan Parcollet

Figure 1 for Stabilising and accelerating light gated recurrent units for automatic speech recognition
Figure 2 for Stabilising and accelerating light gated recurrent units for automatic speech recognition
Viaarxiv icon

Neural approaches to spoken content embedding

Aug 28, 2023
Shane Settle

Figure 1 for Neural approaches to spoken content embedding
Figure 2 for Neural approaches to spoken content embedding
Figure 3 for Neural approaches to spoken content embedding
Figure 4 for Neural approaches to spoken content embedding
Viaarxiv icon

Privacy-oriented manipulation of speaker representations

Oct 10, 2023
Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso

Viaarxiv icon

deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition

Add code
Bookmark button
Alert button
Feb 28, 2023
Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma

Figure 1 for deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition
Figure 2 for deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition
Figure 3 for deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition
Figure 4 for deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition
Viaarxiv icon

Boosting Punctuation Restoration with Data Generation and Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 24, 2023
Viet Dac Lai, Abel Salinas, Hao Tan, Trung Bui, Quan Tran, Seunghyun Yoon, Hanieh Deilamsalehy, Franck Dernoncourt, Thien Huu Nguyen

Viaarxiv icon

Contextual-Utterance Training for Automatic Speech Recognition

Oct 27, 2022
Alejandro Gomez-Alanis, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler

Figure 1 for Contextual-Utterance Training for Automatic Speech Recognition
Figure 2 for Contextual-Utterance Training for Automatic Speech Recognition
Figure 3 for Contextual-Utterance Training for Automatic Speech Recognition
Figure 4 for Contextual-Utterance Training for Automatic Speech Recognition
Viaarxiv icon