Picture for Yuhao Cheng

Yuhao Cheng

Nuanced Emotion Recognition Based on a Segment-based MLLM Framework Leveraging Qwen3-Omni for AH Detection

Add code
Mar 12, 2026
Viaarxiv icon

WildGHand: Learning Anti-Perturbation Gaussian Hand Avatars from Monocular In-the-Wild Videos

Add code
Feb 24, 2026
Viaarxiv icon

SingingBot: An Avatar-Driven System for Robotic Face Singing Performance

Add code
Jan 05, 2026
Viaarxiv icon

MoRE: 3D Visual Geometry Reconstruction Meets Mixture-of-Experts

Add code
Oct 31, 2025
Viaarxiv icon

LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation

Add code
Aug 11, 2025
Figure 1 for LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Figure 2 for LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Figure 3 for LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Figure 4 for LaVieID: Local Autoregressive Diffusion Transformers for Identity-Preserving Video Creation
Viaarxiv icon

SEA: Self-Evolution Agent with Step-wise Reward for Computer Use

Add code
Aug 06, 2025
Viaarxiv icon

Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation

Add code
Dec 19, 2024
Figure 1 for Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
Figure 2 for Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
Figure 3 for Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
Figure 4 for Multimodal Latent Diffusion Model for Complex Sewing Pattern Generation
Viaarxiv icon

EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation

Add code
Dec 06, 2024
Figure 1 for EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Figure 2 for EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Figure 3 for EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Figure 4 for EACO: Enhancing Alignment in Multimodal LLMs via Critical Observation
Viaarxiv icon

Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars

Add code
Oct 11, 2024
Figure 1 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 2 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 3 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Figure 4 for Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars
Viaarxiv icon

Revealing Directions for Text-guided 3D Face Editing

Add code
Oct 07, 2024
Figure 1 for Revealing Directions for Text-guided 3D Face Editing
Figure 2 for Revealing Directions for Text-guided 3D Face Editing
Figure 3 for Revealing Directions for Text-guided 3D Face Editing
Figure 4 for Revealing Directions for Text-guided 3D Face Editing
Viaarxiv icon