Picture for Reo Shimizu

Reo Shimizu

Investigating Human-Model Discrepancies in Speech Quality Assessment via Acoustic and Prosodic Perturbations

Add code
Jun 18, 2026
Viaarxiv icon

PASQA: Pitch-Accent-Focused Speech Quality Assessment Model Trained on Synthetic Speech with Accent Errors

Add code
Jun 18, 2026
Viaarxiv icon

Schödinger Bridge Type Diffusion Models as an Extension of Variational Autoencoders

Add code
Dec 24, 2024
Figure 1 for Schödinger Bridge Type Diffusion Models as an Extension of Variational Autoencoders
Viaarxiv icon

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions

Add code
Sep 15, 2023
Figure 1 for PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
Figure 2 for PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
Viaarxiv icon