Picture for Qifeng Chen

Qifeng Chen

Does Synthetic Layered Design Data Benefit Layered Design Decomposition?

Add code
May 14, 2026
Viaarxiv icon

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

Add code
May 12, 2026
Viaarxiv icon

MedHorizon: Towards Long-context Medical Video Understanding in the Wild

Add code
May 07, 2026
Viaarxiv icon

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Add code
Apr 24, 2026
Viaarxiv icon

Divide-then-Diagnose: Weaving Clinician-Inspired Contexts for Ultra-Long Capsule Endoscopy Videos

Add code
Apr 23, 2026
Viaarxiv icon

Multi-modal Reasoning with LLMs for Visual Semantic Arithmetic

Add code
Apr 21, 2026
Viaarxiv icon

AnimationBench: Are Video Models Good at Character-Centric Animation?

Add code
Apr 16, 2026
Viaarxiv icon

Switch: Learning Agile Skills Switching for Humanoid Robots

Add code
Apr 16, 2026
Viaarxiv icon

Audio-Omni: Extending Multi-modal Understanding to Versatile Audio Generation and Editing

Add code
Apr 12, 2026
Viaarxiv icon

InsEdit: Towards Instruction-based Visual Editing via Data-Efficient Video Diffusion Models Adaptation

Add code
Apr 09, 2026
Viaarxiv icon