Picture for Xiangyue Zhang

Xiangyue Zhang

PersonaGesture: Single-Reference Co-Speech Gesture Personalization for Unseen Speakers

Add code
May 07, 2026
Viaarxiv icon

Deep Researcher Agent: An Autonomous Framework for 24/7 Deep Learning Experimentation with Zero-Cost Monitoring

Add code
Apr 07, 2026
Viaarxiv icon

Not All Frames Are Equal: Complexity-Aware Masked Motion Generation via Motion Spectral Descriptors

Add code
Mar 31, 2026
Viaarxiv icon

MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Add code
Dec 20, 2025
Figure 1 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 2 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 3 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 4 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Viaarxiv icon

Mitigating Error Accumulation in Co-Speech Motion Generation via Global Rotation Diffusion and Multi-Level Constraints

Add code
Nov 13, 2025
Viaarxiv icon

EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation

Add code
Apr 15, 2025
Figure 1 for EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
Figure 2 for EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
Figure 3 for EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
Figure 4 for EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation
Viaarxiv icon

SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis

Add code
Dec 21, 2024
Figure 1 for SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
Figure 2 for SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
Figure 3 for SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
Figure 4 for SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis
Viaarxiv icon