Voice Cloning


UniVoice: Unifying Autoregressive ASR and Flow-Matching based TTS with Large Language Models

Add code
Oct 06, 2025
Viaarxiv icon

Fed-PISA: Federated Voice Cloning via Personalized Identity-Style Adaptation

Add code
Sep 19, 2025
Viaarxiv icon

A Lightweight Pipeline for Noisy Speech Voice Cloning and Accurate Lip Sync Synthesis

Add code
Sep 16, 2025
Viaarxiv icon

MELA-TTS: Joint transformer-diffusion model with representation alignment for speech synthesis

Add code
Sep 18, 2025
Viaarxiv icon

HISPASpoof: A New Dataset For Spanish Speech Forensics

Add code
Sep 11, 2025
Viaarxiv icon

Spectral Masking and Interpolation Attack (SMIA): A Black-box Adversarial Attack against Voice Authentication and Anti-Spoofing Systems

Add code
Sep 09, 2025
Viaarxiv icon

Cloning a Conversational Voice AI Agent from Call\,Recording Datasets for Telesales

Add code
Sep 05, 2025
Viaarxiv icon

Towards Improved Speech Recognition through Optimized Synthetic Data Generation

Add code
Aug 29, 2025
Viaarxiv icon

DiFlow-TTS: Discrete Flow Matching with Factorized Speech Tokens for Low-Latency Zero-Shot Text-To-Speech

Add code
Sep 11, 2025
Viaarxiv icon

Fairness in Dysarthric Speech Synthesis: Understanding Intrinsic Bias in Dysarthric Speech Cloning using F5-TTS

Add code
Aug 07, 2025
Viaarxiv icon