Picture for Jiawei Jin

Jiawei Jin

VividVoice: A Unified Framework for Scene-Aware Visually-Driven Speech Synthesis

Add code
Feb 01, 2026
Viaarxiv icon

"In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion

Add code
Jun 08, 2025
Figure 1 for "In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion
Figure 2 for "In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion
Figure 3 for "In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion
Figure 4 for "In This Environment, As That Speaker": A Text-Driven Framework for Multi-Attribute Speech Conversion
Viaarxiv icon