Picture for Qifeng Chen

Qifeng Chen

The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text

Add code
Dec 18, 2025
Figure 1 for The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
Figure 2 for The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
Figure 3 for The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
Figure 4 for The World is Your Canvas: Painting Promptable Events with Reference Images, Trajectories, and Text
Viaarxiv icon

Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation

Add code
Dec 12, 2025
Figure 1 for Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation
Figure 2 for Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation
Figure 3 for Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation
Figure 4 for Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation
Viaarxiv icon

Zero-shot Synthetic Video Realism Enhancement via Structure-aware Denoising

Add code
Nov 18, 2025
Viaarxiv icon

LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors

Add code
Nov 15, 2025
Figure 1 for LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors
Figure 2 for LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors
Figure 3 for LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors
Figure 4 for LiDAR-GS++:Improving LiDAR Gaussian Reconstruction via Diffusion Priors
Viaarxiv icon

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Add code
Sep 18, 2025
Figure 1 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 2 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 3 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 4 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Viaarxiv icon

Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control

Add code
Aug 12, 2025
Figure 1 for Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Figure 2 for Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Figure 3 for Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Figure 4 for Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Viaarxiv icon

Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis

Add code
Aug 07, 2025
Viaarxiv icon

SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation

Add code
Aug 01, 2025
Viaarxiv icon

LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization

Add code
Jun 11, 2025
Figure 1 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 2 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 3 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 4 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Viaarxiv icon

FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers

Add code
Jun 05, 2025
Figure 1 for FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers
Figure 2 for FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers
Figure 3 for FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers
Figure 4 for FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers
Viaarxiv icon