Picture for Xiu Li

Xiu Li

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Add code
Mar 26, 2026
Viaarxiv icon

Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

Add code
Mar 26, 2026
Viaarxiv icon

TopoMesh: High-Fidelity Mesh Autoencoding via Topological Unification

Add code
Mar 25, 2026
Viaarxiv icon

Identity-Consistent Video Generation under Large Facial-Angle Variations

Add code
Mar 22, 2026
Viaarxiv icon

RoboStereo: Dual-Tower 4D Embodied World Models for Unified Policy Optimization

Add code
Mar 13, 2026
Viaarxiv icon

MoKus: Leveraging Cross-Modal Knowledge Transfer for Knowledge-Aware Concept Customization

Add code
Mar 13, 2026
Viaarxiv icon

InfiniteDance: Scalable 3D Dance Generation Towards in-the-wild Generalization

Add code
Mar 10, 2026
Viaarxiv icon

PreciseCache: Precise Feature Caching for Efficient and High-fidelity Video Generation

Add code
Mar 03, 2026
Viaarxiv icon

Elastic Diffusion Transformer

Add code
Feb 15, 2026
Viaarxiv icon

Temporal Difference Learning with Constrained Initial Representations

Add code
Feb 12, 2026
Viaarxiv icon