Picture for Qi Xu

Qi Xu

Watch, Remember, Reason: Human-View Video Understanding with MLLMs

Add code
Jun 05, 2026
Viaarxiv icon

Towards One-to-Many Temporal Grounding

Add code
Jun 04, 2026
Viaarxiv icon

Masked Generative Transformer Is What You Need for Image Editing

Add code
May 11, 2026
Viaarxiv icon

The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results

Add code
Apr 13, 2026
Viaarxiv icon

Any 3D Scene is Worth 1K Tokens: 3D-Grounded Representation for Scene Generation at Scale

Add code
Apr 13, 2026
Viaarxiv icon

NTIRE 2026 Challenge on Short-form UGC Video Restoration in the Wild with Generative Models: Datasets, Methods and Results

Add code
Apr 12, 2026
Viaarxiv icon

NTIRE 2026 Challenge on Bitstream-Corrupted Video Restoration: Methods and Results

Add code
Apr 09, 2026
Viaarxiv icon

AgentGate: A Lightweight Structured Routing Engine for the Internet of Agents

Add code
Apr 08, 2026
Viaarxiv icon

NTIRE 2026 3D Restoration and Reconstruction in Real-world Adverse Conditions: RealX3D Challenge Results

Add code
Apr 05, 2026
Viaarxiv icon

VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification

Add code
Apr 02, 2026
Viaarxiv icon