Voice Conversion


Voice conversion is the process of converting the voice of one speaker into the voice of another speaker.

Non-invasive electromyographic speech neuroprosthesis: a geometric perspective

Add code
Feb 09, 2025
Viaarxiv icon

Empathetic Conversational Agents: Utilizing Neural and Physiological Signals for Enhanced Empathetic Interactions

Add code
Jan 14, 2025
Viaarxiv icon

Modular Conversational Agents for Surveys and Interviews

Add code
Dec 22, 2024
Figure 1 for Modular Conversational Agents for Surveys and Interviews
Figure 2 for Modular Conversational Agents for Surveys and Interviews
Figure 3 for Modular Conversational Agents for Surveys and Interviews
Figure 4 for Modular Conversational Agents for Surveys and Interviews
Viaarxiv icon

Who Can Withstand Chat-Audio Attacks? An Evaluation Benchmark for Large Language Models

Add code
Nov 22, 2024
Viaarxiv icon

Evaluating Spoken Language as a Biomarker for Automated Screening of Cognitive Impairment

Add code
Jan 30, 2025
Viaarxiv icon

Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis

Add code
Nov 02, 2024
Figure 1 for Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Figure 2 for Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Figure 3 for Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Figure 4 for Fish-Speech: Leveraging Large Language Models for Advanced Multilingual Text-to-Speech Synthesis
Viaarxiv icon

Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality

Add code
Oct 30, 2024
Figure 1 for Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality
Figure 2 for Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality
Figure 3 for Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality
Figure 4 for Audiovisual angle and voice incongruence do not affect audiovisual verbal short-term memory in virtual reality
Viaarxiv icon

Designing AI Personalities: Enhancing Human-Agent Interaction Through Thoughtful Persona Design

Add code
Oct 30, 2024
Viaarxiv icon

Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT

Add code
Nov 05, 2024
Figure 1 for Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
Figure 2 for Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
Figure 3 for Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
Figure 4 for Speaker Emotion Recognition: Leveraging Self-Supervised Models for Feature Extraction Using Wav2Vec2 and HuBERT
Viaarxiv icon

Venire: A Machine Learning-Guided Panel Review System for Community Content Moderation

Add code
Oct 30, 2024
Viaarxiv icon