Picture for Xiaojuan Qi

Xiaojuan Qi

ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning

Add code
May 21, 2026
Viaarxiv icon

LongLive-2.0: An NVFP4 Parallel Infrastructure for Long Video Generation

Add code
May 19, 2026
Viaarxiv icon

Vision Foundation Models as Generalist Tokenizers for Image Generation

Add code
May 18, 2026
Viaarxiv icon

CoRe-Gen: Robust Spectrum-to-Structure Generation under Imperfect Fingerprint Conditions

Add code
May 13, 2026
Viaarxiv icon

PhysEditBench: A Protocol-Conditioned Benchmark for Dense Physical-Map Prediction with Image Editors

Add code
May 13, 2026
Viaarxiv icon

When to Trust Imagination: Adaptive Action Execution for World Action Models

Add code
May 07, 2026
Viaarxiv icon

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Add code
Apr 24, 2026
Viaarxiv icon

AniGen: Unified $S^3$ Fields for Animatable 3D Asset Generation

Add code
Apr 14, 2026
Viaarxiv icon

OpenSpatial: A Principled Data Engine for Empowering Spatial Intelligence

Add code
Apr 09, 2026
Viaarxiv icon

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Add code
Apr 06, 2026
Viaarxiv icon