Picture for Sifan Tu

Sifan Tu

ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding

Add code
Feb 26, 2026
Viaarxiv icon

Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception

Add code
Mar 17, 2025
Figure 1 for Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Figure 2 for Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Figure 3 for Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Figure 4 for Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception
Viaarxiv icon

HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation

Add code
Jan 24, 2025
Figure 1 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 2 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 3 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Figure 4 for HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation
Viaarxiv icon

A Unified Framework for 3D Scene Understanding

Add code
Jul 03, 2024
Figure 1 for A Unified Framework for 3D Scene Understanding
Figure 2 for A Unified Framework for 3D Scene Understanding
Figure 3 for A Unified Framework for 3D Scene Understanding
Figure 4 for A Unified Framework for 3D Scene Understanding
Viaarxiv icon