Picture for Mingyuan Zhang

Mingyuan Zhang

NECromancer: Breathing Life into Skeletons via BVH Animation

Add code
Feb 06, 2026
Viaarxiv icon

DiMo: Discrete Diffusion Modeling for Motion Generation and Understanding

Add code
Feb 04, 2026
Viaarxiv icon

Rethinking Fine-Tuning: Unlocking Hidden Capabilities in Vision-Language Models

Add code
Dec 28, 2025
Viaarxiv icon

MoCapAnything: Unified 3D Motion Capture for Arbitrary Skeletons from Monocular Videos

Add code
Dec 11, 2025
Viaarxiv icon

SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation

Add code
Dec 11, 2025
Figure 1 for SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Figure 2 for SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Figure 3 for SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Figure 4 for SWiT-4D: Sliding-Window Transformer for Lossless and Parameter-Free Temporal 4D Generation
Viaarxiv icon

Landmark Guided Visual Feature Extractor for Visual Speech Recognition with Limited Resource

Add code
Aug 10, 2025
Viaarxiv icon

Semantics-Aware Human Motion Generation from Audio Instructions

Add code
May 29, 2025
Viaarxiv icon

Boosting Large Language Models with Mask Fine-Tuning

Add code
Mar 27, 2025
Viaarxiv icon

SMPLest-X: Ultimate Scaling for Expressive Human Pose and Shape Estimation

Add code
Jan 16, 2025
Viaarxiv icon

RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse

Add code
Dec 05, 2024
Figure 1 for RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
Figure 2 for RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
Figure 3 for RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
Figure 4 for RMD: A Simple Baseline for More General Human Motion Generation via Training-free Retrieval-Augmented Motion Diffuse
Viaarxiv icon