Alert button

"speech": models, code, and papers
Alert button

Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement

Add code
Bookmark button
Alert button
Aug 17, 2023
Ye-Xin Lu, Yang Ai, Zhen-Hua Ling

Figure 1 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 2 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 3 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Figure 4 for Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement
Viaarxiv icon

Federated Learning with Differential Privacy for End-to-End Speech Recognition

Sep 29, 2023
Martin Pelikan, Sheikh Shams Azam, Vitaly Feldman, Jan "Honza" Silovsky, Kunal Talwar, Tatiana Likhomanenko

Viaarxiv icon

Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach

Nov 09, 2023
Cristina Palmero, Mikel deVelasco, Mohamed Amine Hmani, Aymen Mtibaa, Leila Ben Letaifa, Pau Buch-Cardona, Raquel Justo, Terry Amorese, Eduardo González-Fraile, Begoña Fernández-Ruanova, Jofre Tenorio-Laranga, Anna Torp Johansen, Micaela Rodrigues da Silva, Liva Jenny Martinussen, Maria Stylianou Korsnes, Gennaro Cordasco, Anna Esposito, Mounim A. El-Yacoubi, Dijana Petrovska-Delacrétaz, M. Inés Torres, Sergio Escalera

Figure 1 for Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
Figure 2 for Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
Figure 3 for Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
Figure 4 for Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach
Viaarxiv icon

Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts

Add code
Bookmark button
Alert button
Jul 14, 2023
Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao

Figure 1 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 2 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 3 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Figure 4 for Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts
Viaarxiv icon

Break it, Imitate it, Fix it: Robustness by Generating Human-Like Attacks

Add code
Bookmark button
Alert button
Oct 25, 2023
Aradhana Sinha, Ananth Balashankar, Ahmad Beirami, Thi Avrahami, Jilin Chen, Alex Beutel

Viaarxiv icon

mahaNLP: A Marathi Natural Language Processing Library

Add code
Bookmark button
Alert button
Nov 05, 2023
Vidula Magdum, Omkar Dhekane, Sharayu Hiwarkhedkar, Saloni Mittal, Raviraj Joshi

Figure 1 for mahaNLP: A Marathi Natural Language Processing Library
Figure 2 for mahaNLP: A Marathi Natural Language Processing Library
Figure 3 for mahaNLP: A Marathi Natural Language Processing Library
Figure 4 for mahaNLP: A Marathi Natural Language Processing Library
Viaarxiv icon

Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?

Add code
Bookmark button
Alert button
Sep 12, 2023
Xin Wang, Junichi Yamagishi

Figure 1 for Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?
Figure 2 for Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?
Figure 3 for Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?
Figure 4 for Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?
Viaarxiv icon

Optimized Tokenization for Transcribed Error Correction

Add code
Bookmark button
Alert button
Oct 16, 2023
Tomer Wullach, Shlomo E. Chazan

Figure 1 for Optimized Tokenization for Transcribed Error Correction
Figure 2 for Optimized Tokenization for Transcribed Error Correction
Figure 3 for Optimized Tokenization for Transcribed Error Correction
Figure 4 for Optimized Tokenization for Transcribed Error Correction
Viaarxiv icon

Weigh Your Own Words: Improving Hate Speech Counter Narrative Generation via Attention Regularization

Add code
Bookmark button
Alert button
Sep 05, 2023
Helena Bonaldi, Giuseppe Attanasio, Debora Nozza, Marco Guerini

Viaarxiv icon

On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis

Add code
Bookmark button
Alert button
Jul 11, 2023
Siyang Wang, Gustav Eje Henter, Joakim Gustafson, Éva Székely

Figure 1 for On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
Figure 2 for On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
Figure 3 for On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
Figure 4 for On the Use of Self-Supervised Speech Representations in Spontaneous Speech Synthesis
Viaarxiv icon