Alert button

"speech": models, code, and papers
Alert button

Emotional Speech Synthesis for Companion Robot to Imitate Professional Caregiver Speech

Sep 27, 2021
Takeshi Homma, Qinghua Sun, Takuya Fujioka, Ryuta Takawaki, Eriko Ankyu, Kenji Nagamatsu, Daichi Sugawara, Etsuko T. Harada

Figure 1 for Emotional Speech Synthesis for Companion Robot to Imitate Professional Caregiver Speech
Figure 2 for Emotional Speech Synthesis for Companion Robot to Imitate Professional Caregiver Speech
Figure 3 for Emotional Speech Synthesis for Companion Robot to Imitate Professional Caregiver Speech
Figure 4 for Emotional Speech Synthesis for Companion Robot to Imitate Professional Caregiver Speech
Viaarxiv icon

A single speaker is almost all you need for automatic speech recognition

Add code
Bookmark button
Alert button
Mar 29, 2022
Edresson Casanova, Christopher Shulby, Alexander Korolev, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Aluísio, Moacir Antonelli Ponti

Figure 1 for A single speaker is almost all you need for automatic speech recognition
Figure 2 for A single speaker is almost all you need for automatic speech recognition
Figure 3 for A single speaker is almost all you need for automatic speech recognition
Viaarxiv icon

Revisiting Speech Content Privacy

Add code
Bookmark button
Alert button
Oct 13, 2021
Jennifer Williams, Junichi Yamagishi, Paul-Gauthier Noe, Cassia Valentini Botinhao, Jean-Francois Bonastre

Figure 1 for Revisiting Speech Content Privacy
Figure 2 for Revisiting Speech Content Privacy
Figure 3 for Revisiting Speech Content Privacy
Viaarxiv icon

Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech

Add code
Bookmark button
Alert button
Sep 07, 2022
Huu-Tien Dang, Thi-Hai-Yen Vuong, Xuan-Hieu Phan

Figure 1 for Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Figure 2 for Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Figure 3 for Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Figure 4 for Non-Standard Vietnamese Word Detection and Normalization for Text-to-Speech
Viaarxiv icon

Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data

Feb 13, 2023
Gorka Abad, Oguzhan Ersoy, Stjepan Picek, Aitor Urbieta

Figure 1 for Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Figure 2 for Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Figure 3 for Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Figure 4 for Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Viaarxiv icon

Dysfluencies Seldom Come Alone -- Detection as a Multi-Label Problem

Oct 28, 2022
Sebastian P. Bayerl, Dominik Wagner, Florian Hönig, Tobias Bocklet, Elmar Nöth, Korbinian Riedhammer

Figure 1 for Dysfluencies Seldom Come Alone -- Detection as a Multi-Label Problem
Figure 2 for Dysfluencies Seldom Come Alone -- Detection as a Multi-Label Problem
Figure 3 for Dysfluencies Seldom Come Alone -- Detection as a Multi-Label Problem
Viaarxiv icon

An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks

Add code
Bookmark button
Alert button
Mar 31, 2022
Kai-Wei Chang, Wei-Cheng Tseng, Shang-Wen Li, Hung-yi Lee

Figure 1 for An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Figure 2 for An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Figure 3 for An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Figure 4 for An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks
Viaarxiv icon

Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features

Add code
Bookmark button
Alert button
Nov 09, 2022
Ziqian Ning, Qicong Xie, Pengcheng Zhu, Zhichao Wang, Liumeng Xue, Jixun Yao, Lei Xie, Mengxiao Bi

Figure 1 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 2 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 3 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 4 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Viaarxiv icon

Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition

Feb 21, 2022
Mengzhe Geng, Xurong Xie, Zi Ye, Tianzi Wang, Guinan Li, Shujie Hu, Xunying Liu, Helen Meng

Figure 1 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 2 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 3 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 4 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Viaarxiv icon

Towards Error-Resilient Neural Speech Coding

Jul 03, 2022
Huaying Xue, Xiulian Peng, Xue Jiang, Yan Lu

Figure 1 for Towards Error-Resilient Neural Speech Coding
Figure 2 for Towards Error-Resilient Neural Speech Coding
Figure 3 for Towards Error-Resilient Neural Speech Coding
Figure 4 for Towards Error-Resilient Neural Speech Coding
Viaarxiv icon