Voice Conversion


Voice conversion is the process of converting the voice of one speaker into the voice of another speaker.

Advancing NAM-to-Speech Conversion with Novel Methods and the MultiNAM Dataset

Add code
Dec 25, 2024
Viaarxiv icon

FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems

Add code
Feb 19, 2025
Figure 1 for FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems
Figure 2 for FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems
Figure 3 for FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems
Figure 4 for FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems
Viaarxiv icon

A Review of Challenges in Speech-based Conversational AI for Elderly Care

Add code
Dec 10, 2024
Viaarxiv icon

Building low-resource African language corpora: A case study of Kidawida, Kalenjin and Dholuo

Add code
Jan 19, 2025
Viaarxiv icon

Non-invasive electromyographic speech neuroprosthesis: a geometric perspective

Add code
Feb 09, 2025
Viaarxiv icon

Empathetic Conversational Agents: Utilizing Neural and Physiological Signals for Enhanced Empathetic Interactions

Add code
Jan 14, 2025
Viaarxiv icon

Modular Conversational Agents for Surveys and Interviews

Add code
Dec 22, 2024
Figure 1 for Modular Conversational Agents for Surveys and Interviews
Figure 2 for Modular Conversational Agents for Surveys and Interviews
Figure 3 for Modular Conversational Agents for Surveys and Interviews
Figure 4 for Modular Conversational Agents for Surveys and Interviews
Viaarxiv icon

Who Can Withstand Chat-Audio Attacks? An Evaluation Benchmark for Large Language Models

Add code
Nov 22, 2024
Viaarxiv icon

Evaluating Spoken Language as a Biomarker for Automated Screening of Cognitive Impairment

Add code
Jan 30, 2025
Viaarxiv icon

Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT

Add code
Nov 05, 2024
Figure 1 for Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
Figure 2 for Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
Figure 3 for Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
Figure 4 for Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
Viaarxiv icon