Picture for Chongyang Ma

Chongyang Ma

Envision: Embodied Visual Planning via Goal-Imagery Video Diffusion

Add code
Dec 27, 2025
Viaarxiv icon

VIVA: VLM-Guided Instruction-Based Video Editing with Reward Optimization

Add code
Dec 18, 2025
Viaarxiv icon

MAGREF: Masked Guidance for Any-Reference Video Generation

Add code
May 29, 2025
Viaarxiv icon

ATI: Any Trajectory Instruction for Controllable Video Generation

Add code
May 28, 2025
Viaarxiv icon

CINEMA: Coherent Multi-Subject Video Generation via MLLM-Based Guidance

Add code
Mar 13, 2025
Viaarxiv icon

Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting

Add code
Jan 26, 2025
Figure 1 for Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Figure 2 for Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Figure 3 for Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Figure 4 for Bringing Characters to New Stories: Training-Free Theme-Specific Image Generation via Dynamic Visual Prompting
Viaarxiv icon

DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning

Add code
Oct 28, 2024
Figure 1 for DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Figure 2 for DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Figure 3 for DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Figure 4 for DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning
Viaarxiv icon

Towards Unified 3D Hair Reconstruction from Single-View Portraits

Add code
Sep 25, 2024
Figure 1 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 2 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 3 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Figure 4 for Towards Unified 3D Hair Reconstruction from Single-View Portraits
Viaarxiv icon

ViMo: Generating Motions from Casual Videos

Add code
Aug 13, 2024
Figure 1 for ViMo: Generating Motions from Casual Videos
Figure 2 for ViMo: Generating Motions from Casual Videos
Figure 3 for ViMo: Generating Motions from Casual Videos
Figure 4 for ViMo: Generating Motions from Casual Videos
Viaarxiv icon

LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model

Add code
May 06, 2024
Figure 1 for LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model
Figure 2 for LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model
Figure 3 for LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model
Figure 4 for LGTM: Local-to-Global Text-Driven Human Motion Diffusion Model
Viaarxiv icon