Picture for Pengfei Wan

Pengfei Wan

SemanticGen: Video Generation in Semantic Space

Add code
Dec 24, 2025
Viaarxiv icon

Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models

Add code
Dec 22, 2025
Viaarxiv icon

Kling-Omni Technical Report

Add code
Dec 18, 2025
Viaarxiv icon

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Add code
Dec 17, 2025
Viaarxiv icon

MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives

Add code
Dec 16, 2025
Viaarxiv icon

Astra: General Interactive World Model with Autoregressive Denoising

Add code
Dec 15, 2025
Viaarxiv icon

KlingAvatar 2.0 Technical Report

Add code
Dec 15, 2025
Viaarxiv icon

FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive Diffusion

Add code
Dec 12, 2025
Viaarxiv icon

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Add code
Dec 12, 2025
Viaarxiv icon

UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation

Add code
Dec 08, 2025
Viaarxiv icon