Picture for Qifeng Chen

Qifeng Chen

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data

Add code
Sep 18, 2025
Figure 1 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 2 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 3 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Figure 4 for ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
Viaarxiv icon

Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control

Add code
Aug 12, 2025
Figure 1 for Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Figure 2 for Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Figure 3 for Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Figure 4 for Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control
Viaarxiv icon

Follow-Your-Instruction: A Comprehensive MLLM Agent for World Data Synthesis

Add code
Aug 07, 2025
Viaarxiv icon

SpA2V: Harnessing Spatial Auditory Cues for Audio-driven Spatially-aware Video Generation

Add code
Aug 01, 2025
Viaarxiv icon

LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization

Add code
Jun 11, 2025
Figure 1 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 2 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 3 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Figure 4 for LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization
Viaarxiv icon

Follow-Your-Motion: Video Motion Transfer via Efficient Spatial-Temporal Decoupled Finetuning

Add code
Jun 05, 2025
Viaarxiv icon

FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers

Add code
Jun 05, 2025
Viaarxiv icon

Follow-Your-Creation: Empowering 4D Creation through Video Inpainting

Add code
Jun 05, 2025
Figure 1 for Follow-Your-Creation: Empowering 4D Creation through Video Inpainting
Figure 2 for Follow-Your-Creation: Empowering 4D Creation through Video Inpainting
Figure 3 for Follow-Your-Creation: Empowering 4D Creation through Video Inpainting
Figure 4 for Follow-Your-Creation: Empowering 4D Creation through Video Inpainting
Viaarxiv icon

UNIC: Unified In-Context Video Editing

Add code
Jun 04, 2025
Figure 1 for UNIC: Unified In-Context Video Editing
Figure 2 for UNIC: Unified In-Context Video Editing
Viaarxiv icon

Master Rules from Chaos: Learning to Reason, Plan, and Interact from Chaos for Tangram Assembly

Add code
May 17, 2025
Viaarxiv icon