Picture for Xiu Li

Xiu Li

KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration

Add code
May 14, 2026
Viaarxiv icon

RoTE: Coarse-to-Fine Multi-Level Rotary Time Embedding for Sequential Recommendation

Add code
Apr 15, 2026
Viaarxiv icon

PRISM: Rethinking Scattered Atmosphere Reconstruction as a Unified Understanding and Generation Model for Real-world Dehazing

Add code
Apr 08, 2026
Viaarxiv icon

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Add code
Apr 06, 2026
Viaarxiv icon

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Add code
Mar 26, 2026
Viaarxiv icon

Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

Add code
Mar 26, 2026
Viaarxiv icon

TopoMesh: High-Fidelity Mesh Autoencoding via Topological Unification

Add code
Mar 25, 2026
Viaarxiv icon

Identity-Consistent Video Generation under Large Facial-Angle Variations

Add code
Mar 22, 2026
Viaarxiv icon

RoboStereo: Dual-Tower 4D Embodied World Models for Unified Policy Optimization

Add code
Mar 13, 2026
Viaarxiv icon

MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization

Add code
Mar 13, 2026
Viaarxiv icon