Picture for Si Liu

Si Liu

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Add code
Sep 11, 2025
Viaarxiv icon

AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation

Add code
Aug 21, 2025
Viaarxiv icon

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Add code
Aug 07, 2025
Viaarxiv icon

DOMR: Establishing Cross-View Segmentation via Dense Object Matching

Add code
Aug 06, 2025
Viaarxiv icon

CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective

Add code
Aug 01, 2025
Viaarxiv icon

OctoNav: Towards Generalist Embodied Navigation

Add code
Jun 11, 2025
Viaarxiv icon

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Add code
Jun 07, 2025
Viaarxiv icon

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

Add code
May 21, 2025
Viaarxiv icon

ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images

Add code
May 10, 2025
Viaarxiv icon

EvMic: Event-based Non-contact sound recovery from effective spatial-temporal modeling

Add code
Apr 03, 2025
Viaarxiv icon