Picture for Xuehai Bai

Xuehai Bai

MCIE: Multimodal LLM-Driven Complex Instruction Image Editing with Spatial Guidance

Add code
Feb 08, 2026
Viaarxiv icon

How Well Do Models Follow Visual Instructions? VIBE: A Systematic Benchmark for Visual Instruction-Driven Image Editing

Add code
Feb 02, 2026
Viaarxiv icon

FPSAttention: Training-Aware FP8 and Sparsity Co-Design for Fast Video Diffusion

Add code
Jun 06, 2025
Viaarxiv icon