Picture for Chaoren Wang

Chaoren Wang

Aliasing-Free Neural Audio Synthesis

Add code
Dec 23, 2025
Viaarxiv icon

SpeechJudge: Towards Human-Level Judgment for Speech Naturalness

Add code
Nov 11, 2025
Viaarxiv icon

SP-MCQA: Evaluating Intelligibility of TTS Beyond the Word Level

Add code
Oct 30, 2025
Viaarxiv icon

Neurodyne: Neural Pitch Manipulation with Representation Learning and Cycle-Consistency GAN

Add code
May 21, 2025
Viaarxiv icon

DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation

Add code
May 19, 2025
Viaarxiv icon

SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset

Add code
May 14, 2025
Viaarxiv icon

Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment

Add code
May 07, 2025
Viaarxiv icon

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Add code
Jan 27, 2025
Viaarxiv icon

Overview of the Amphion Toolkit (v0.2)

Add code
Jan 26, 2025
Figure 1 for Overview of the Amphion Toolkit (v0.2)
Figure 2 for Overview of the Amphion Toolkit (v0.2)
Figure 3 for Overview of the Amphion Toolkit (v0.2)
Figure 4 for Overview of the Amphion Toolkit (v0.2)
Viaarxiv icon

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

Add code
Jul 07, 2024
Figure 1 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 2 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 3 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Figure 4 for Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation
Viaarxiv icon