Alert button

"speech": models, code, and papers
Alert button

Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models

Jan 23, 2024
Chenyang Gao, Brecht Desplanques, Chelsea J. -T. Ju, Aman Chadha, Andreas Stolcke

Viaarxiv icon

The GUA-Speech System Description for CNVSRC Challenge 2023

Dec 12, 2023
Shengqiang Li, Chao Lei, Baozhong Ma, Binbin Zhang, Fuping Pan

Viaarxiv icon

StreamVC: Real-Time Low-Latency Voice Conversion

Jan 05, 2024
Yang Yang, Yury Kartynnik, Yunpeng Li, Jiuqiang Tang, Xing Li, George Sung, Matthias Grundmann

Viaarxiv icon

Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers

Jan 26, 2024
Kerlos Atia Abdalmalak, Ascensión Gallardo-Antol'in

Viaarxiv icon

Evaluating Self-supervised Speech Models on a Taiwanese Hokkien Corpus

Dec 06, 2023
Yi-Hui Chou, Kalvin Chang, Meng-Ju Wu, Winston Ou, Alice Wen-Hsin Bi, Carol Yang, Bryan Y. Chen, Rong-Wei Pai, Po-Yen Yeh, Jo-Peng Chiang, Iu-Tshian Phoann, Winnie Chang, Chenxuan Cui, Noel Chen, Jiatong Shi

Figure 1 for Evaluating Self-supervised Speech Models on a Taiwanese Hokkien Corpus
Figure 2 for Evaluating Self-supervised Speech Models on a Taiwanese Hokkien Corpus
Viaarxiv icon

Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification

Dec 12, 2023
Mohammed Maqsood Shaik, Dietrich Klakow, Badr M. Abdullah

Figure 1 for Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification
Figure 2 for Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification
Figure 3 for Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification
Viaarxiv icon

Evaluating and Personalizing User-Perceived Quality of Text-to-Speech Voices for Delivering Mindfulness Meditation with Different Physical Embodiments

Jan 07, 2024
Zhonghao Shi, Han Chen, Anna-Maria Velentza, Siqi Liu, Nathaniel Dennler, Allison O'Connell, Maja Matarić

Viaarxiv icon

Keyword spotting -- Detecting commands in speech using deep learning

Dec 09, 2023
Sumedha Rai, Tong Li, Bella Lyu

Figure 1 for Keyword spotting -- Detecting commands in speech using deep learning
Figure 2 for Keyword spotting -- Detecting commands in speech using deep learning
Figure 3 for Keyword spotting -- Detecting commands in speech using deep learning
Figure 4 for Keyword spotting -- Detecting commands in speech using deep learning
Viaarxiv icon

Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor

Jan 23, 2024
Younglo Lee, Shukjae Choi, Byeong-Yeol Kim, Zhong-Qiu Wang, Shinji Watanabe

Viaarxiv icon

Leveraged Mel spectrograms using Harmonic and Percussive Components in Speech Emotion Recognition

Dec 18, 2023
David Hason Rudd, Huan Huo, Guandong Xu

Viaarxiv icon