Picture for Qifeng Chen

Qifeng Chen

AC-Foley: Reference-Audio-Guided Video-to-Audio Synthesis with Acoustic Transfer

Add code
Mar 16, 2026
Viaarxiv icon

Instruction-based Image Editing with Planning, Reasoning, and Generation

Add code
Feb 26, 2026
Viaarxiv icon

FastVMT: Eliminating Redundancy in Video Motion Transfer

Add code
Feb 05, 2026
Viaarxiv icon

HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos

Add code
Feb 02, 2026
Viaarxiv icon

Show, Don't Tell: Morphing Latent Reasoning into Image Generation

Add code
Feb 02, 2026
Viaarxiv icon

FlyAware: Inertia-Aware Aerial Manipulation via Vision-Based Estimation and Post-Grasp Adaptation

Add code
Jan 30, 2026
Viaarxiv icon

TIGaussian: Disentangle Gaussians for Spatial-Awared Text-Image-3D Alignment

Add code
Jan 27, 2026
Viaarxiv icon

Active Intelligence in Video Avatars via Closed-loop World Modeling

Add code
Dec 23, 2025
Viaarxiv icon

LongVideoAgent: Multi-Agent Reasoning with Long Videos

Add code
Dec 23, 2025
Figure 1 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 2 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 3 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Figure 4 for LongVideoAgent: Multi-Agent Reasoning with Long Videos
Viaarxiv icon

Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations

Add code
Dec 22, 2025
Viaarxiv icon