Picture for Pengfei Wan

Pengfei Wan

AnchorWorld: Embodied Egocentric World Simulation with View-based Evolution Customization

Add code
Jun 05, 2026
Viaarxiv icon

Edit-R2: Context-Aware Reinforcement Learning for Multi-Turn Image Editing

Add code
Jun 04, 2026
Viaarxiv icon

Diffusing in the Right Space: A Systematic Study of Latent Diffusability

Add code
Jun 02, 2026
Viaarxiv icon

VLMs are Good Teachers for Video Reasoning via Adaptive Test-Time Optimization

Add code
Jun 01, 2026
Viaarxiv icon

Geometry-Aware Implicit Memory for Video World Models

Add code
Jun 01, 2026
Viaarxiv icon

SegTune: Structured and Fine-Grained Control for Song Generation

Add code
May 31, 2026
Viaarxiv icon

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

Add code
May 29, 2026
Viaarxiv icon

LongAV-Compass: Towards Unified Evaluation of Minute-Scale Audio-Visual Generation Across T2AV, I2AV, and V2AV

Add code
May 25, 2026
Viaarxiv icon

LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning

Add code
May 21, 2026
Viaarxiv icon

SRC-Flow: Compact Semantic Representations Enable Normalizing Flows for Image Generation

Add code
May 18, 2026
Viaarxiv icon