Picture for Chengming Xu

Chengming Xu

SPOT-E: Test-Time Entropy Shaping with Visual Spotlights for Frozen VLMs

Add code
Jun 18, 2026
Viaarxiv icon

JAVEDIT: Joint Audio-Visual Instruction-Guided Video Editing with Agentic Data Curation

Add code
Jun 02, 2026
Viaarxiv icon

What Semantics Survive the Connector? Diagnosing VLM-to-DiT Alignment in Video Editing

Add code
May 20, 2026
Viaarxiv icon

PixVerve: Advancing Native UHR Image Generation to 100MP with a Large-Scale High-Quality Dataset

Add code
May 19, 2026
Viaarxiv icon

Evolution of Optimization Methods: Algorithms, Scenarios, and Evaluations

Add code
Apr 14, 2026
Viaarxiv icon

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Add code
Apr 02, 2026
Viaarxiv icon

Dual Latent Memory for Visual Multi-agent System

Add code
Jan 31, 2026
Viaarxiv icon

FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing

Add code
Jan 06, 2026
Viaarxiv icon

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Add code
Dec 15, 2025
Viaarxiv icon

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon