Picture for Si Liu

Si Liu

MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning

Add code
Oct 16, 2025
Viaarxiv icon

Factuality Matters: When Image Generation and Editing Meet Structured Visuals

Add code
Oct 06, 2025
Viaarxiv icon

FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehensive Benchmark

Add code
Sep 11, 2025
Viaarxiv icon

AeroDuo: Aerial Duo for UAV-based Vision and Language Navigation

Add code
Aug 21, 2025
Viaarxiv icon

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Add code
Aug 07, 2025
Viaarxiv icon

DOMR: Establishing Cross-View Segmentation via Dense Object Matching

Add code
Aug 06, 2025
Viaarxiv icon

CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective

Add code
Aug 01, 2025
Viaarxiv icon

OctoNav: Towards Generalist Embodied Navigation

Add code
Jun 11, 2025
Viaarxiv icon

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Add code
Jun 07, 2025
Viaarxiv icon

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

Add code
May 21, 2025
Viaarxiv icon