Picture for Si Liu

Si Liu

Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation

Add code
Aug 07, 2025
Viaarxiv icon

DOMR: Establishing Cross-View Segmentation via Dense Object Matching

Add code
Aug 06, 2025
Viaarxiv icon

CoST: Efficient Collaborative Perception From Unified Spatiotemporal Perspective

Add code
Aug 01, 2025
Viaarxiv icon

OctoNav: Towards Generalist Embodied Navigation

Add code
Jun 11, 2025
Viaarxiv icon

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Add code
Jun 07, 2025
Viaarxiv icon

UAV-Flow Colosseo: A Real-World Benchmark for Flying-on-a-Word UAV Imitation Learning

Add code
May 21, 2025
Viaarxiv icon

ProFashion: Prototype-guided Fashion Video Generation with Multiple Reference Images

Add code
May 10, 2025
Viaarxiv icon

EvMic: Event-based Non-contact sound recovery from effective spatial-temporal modeling

Add code
Apr 03, 2025
Viaarxiv icon

LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models

Add code
Mar 27, 2025
Figure 1 for LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models
Figure 2 for LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models
Figure 3 for LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models
Figure 4 for LLaVA-CMoE: Towards Continual Mixture of Experts for Large Vision-Language Models
Viaarxiv icon

Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs

Add code
Mar 26, 2025
Figure 1 for Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Figure 2 for Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Figure 3 for Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Figure 4 for Instruction-Oriented Preference Alignment for Enhancing Multi-Modal Comprehension Capability of MLLMs
Viaarxiv icon