Picture for Xiaojuan Qi

Xiaojuan Qi

See, Remember, Explore: A Benchmark and Baselines for Streaming Spatial Reasoning

Add code
Mar 25, 2026
Viaarxiv icon

LiFR-Seg: Anytime High-Frame-Rate Segmentation via Event-Guided Propagation

Add code
Mar 22, 2026
Viaarxiv icon

Stereo World Model: Camera-Guided Stereo Video Generation

Add code
Mar 18, 2026
Viaarxiv icon

Stable Velocity: A Variance Perspective on Flow Matching

Add code
Feb 05, 2026
Viaarxiv icon

Self-Evaluation Unlocks Any-Step Text-to-Image Generation

Add code
Dec 26, 2025
Viaarxiv icon

Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models

Add code
Dec 23, 2025
Viaarxiv icon

ASSIST-3D: Adapted Scene Synthesis for Class-Agnostic 3D Instance Segmentation

Add code
Dec 10, 2025
Viaarxiv icon

Efficient lattice field theory simulation using adaptive normalizing flow on a resistive memory-based neural differential equation solver

Add code
Sep 16, 2025
Viaarxiv icon

Understanding Data Influence with Differential Approximation

Add code
Aug 20, 2025
Viaarxiv icon

NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding

Add code
Aug 20, 2025
Figure 1 for NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding
Figure 2 for NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding
Figure 3 for NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding
Figure 4 for NoteIt: A System Converting Instructional Videos to Interactable Notes Through Multimodal Video Understanding
Viaarxiv icon