Picture for Shunshun Yin

Shunshun Yin

SoulX-Singer: Towards High-Quality Zero-Shot Singing Voice Synthesis

Add code
Feb 08, 2026
Viaarxiv icon

SoulX-FlashHead: Oracle-guided Generation of Infinite Real-time Streaming Talking Heads

Add code
Feb 07, 2026
Viaarxiv icon

SoulX-FlashTalk: Real-Time Infinite Streaming of Audio-Driven Avatars via Self-Correcting Bidirectional Distillation

Add code
Jan 06, 2026
Viaarxiv icon

SoulX-LiveTalk: Real-Time Infinite Streaming of Audio-Driven Avatars via Self-Correcting Bidirectional Distillation

Add code
Dec 31, 2025
Viaarxiv icon

RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer

Add code
Aug 07, 2025
Viaarxiv icon

Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression

Add code
Jun 11, 2025
Figure 1 for Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression
Figure 2 for Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression
Figure 3 for Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression
Figure 4 for Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression
Viaarxiv icon