Picture for Jinlu Zhang

Jinlu Zhang

Persistent Story World Simulation with Continuous Character Customization

Add code
Mar 17, 2026
Viaarxiv icon

OpenDance: Multimodal Controllable 3D Dance Generation Using Large-scale Internet Data

Add code
Jun 09, 2025
Figure 1 for OpenDance: Multimodal Controllable 3D Dance Generation Using Large-scale Internet Data
Figure 2 for OpenDance: Multimodal Controllable 3D Dance Generation Using Large-scale Internet Data
Figure 3 for OpenDance: Multimodal Controllable 3D Dance Generation Using Large-scale Internet Data
Figure 4 for OpenDance: Multimodal Controllable 3D Dance Generation Using Large-scale Internet Data
Viaarxiv icon

InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing

Add code
May 30, 2025
Figure 1 for InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Figure 2 for InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Figure 3 for InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Figure 4 for InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing
Viaarxiv icon

EMO-X: Efficient Multi-Person Pose and Shape Estimation in One-Stage

Add code
Apr 11, 2025
Viaarxiv icon

OwlSight: A Robust Illumination Adaptation Framework for Dark Video Human Action Recognition

Add code
Mar 30, 2025
Viaarxiv icon

StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization

Add code
Dec 10, 2024
Figure 1 for StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
Figure 2 for StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
Figure 3 for StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
Figure 4 for StoryWeaver: A Unified World Model for Knowledge-Enhanced Story Character Customization
Viaarxiv icon

Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation

Add code
Jun 26, 2024
Figure 1 for Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation
Figure 2 for Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation
Figure 3 for Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation
Figure 4 for Expressive Keypoints for Skeleton-based Action Recognition via Skeleton Transformation
Viaarxiv icon

A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene

Add code
Apr 17, 2024
Figure 1 for A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Figure 2 for A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Figure 3 for A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Figure 4 for A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Viaarxiv icon

Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance

Add code
Mar 26, 2024
Viaarxiv icon

Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization

Add code
Mar 11, 2024
Figure 1 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 2 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 3 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Figure 4 for Fast Text-to-3D-Aware Face Generation and Manipulation via Direct Cross-modal Mapping and Geometric Regularization
Viaarxiv icon