Picture for Jinting Wang

Jinting Wang

UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation

Add code
Jun 04, 2025
Viaarxiv icon

AudioGenie: A Training-Free Multi-Agent Framework for Diverse Multimodality-to-Multiaudio Generation

Add code
May 28, 2025
Viaarxiv icon

A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights

Add code
Jul 11, 2024
Figure 1 for A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Figure 2 for A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Figure 3 for A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Figure 4 for A Comprehensive Survey on Human Video Generation: Challenges, Methods, and Insights
Viaarxiv icon

Realistic Speech-to-Face Generation with Speech-Conditioned Latent Diffusion Model with Face Prior

Add code
Oct 05, 2023
Viaarxiv icon

A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation

Add code
Aug 17, 2023
Figure 1 for A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Figure 2 for A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Figure 3 for A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Figure 4 for A Survey on Deep Multi-modal Learning for Body Language Recognition and Generation
Viaarxiv icon