Picture for Pengfei Wan

Pengfei Wan

SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs

Add code
Aug 21, 2024
Viaarxiv icon

ViMo: Generating Motions from Casual Videos

Add code
Aug 13, 2024
Viaarxiv icon

PlacidDreamer: Advancing Harmony in Text-to-3D Generation

Add code
Jul 19, 2024
Figure 1 for PlacidDreamer: Advancing Harmony in Text-to-3D Generation
Figure 2 for PlacidDreamer: Advancing Harmony in Text-to-3D Generation
Figure 3 for PlacidDreamer: Advancing Harmony in Text-to-3D Generation
Figure 4 for PlacidDreamer: Advancing Harmony in Text-to-3D Generation
Viaarxiv icon

4Dynamic: Text-to-4D Generation with Hybrid Priors

Add code
Jul 17, 2024
Viaarxiv icon

LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control

Add code
Jul 03, 2024
Viaarxiv icon

VideoTetris: Towards Compositional Text-to-Video Generation

Add code
Jun 06, 2024
Figure 1 for VideoTetris: Towards Compositional Text-to-Video Generation
Figure 2 for VideoTetris: Towards Compositional Text-to-Video Generation
Figure 3 for VideoTetris: Towards Compositional Text-to-Video Generation
Figure 4 for VideoTetris: Towards Compositional Text-to-Video Generation
Viaarxiv icon

SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance

Add code
May 24, 2024
Figure 1 for SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
Figure 2 for SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
Figure 3 for SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
Figure 4 for SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance
Viaarxiv icon

UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark

Add code
Apr 15, 2024
Figure 1 for UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
Figure 2 for UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
Figure 3 for UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
Figure 4 for UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark
Viaarxiv icon

Motion Inversion for Video Customization

Add code
Mar 29, 2024
Viaarxiv icon

VRMM: A Volumetric Relightable Morphable Head Model

Add code
Feb 06, 2024
Figure 1 for VRMM: A Volumetric Relightable Morphable Head Model
Figure 2 for VRMM: A Volumetric Relightable Morphable Head Model
Figure 3 for VRMM: A Volumetric Relightable Morphable Head Model
Figure 4 for VRMM: A Volumetric Relightable Morphable Head Model
Viaarxiv icon