Picture for Xiaosong Jia

Xiaosong Jia

EvoMemNav: Efficient Self-Evolving Fine-Grained Memory for Zero-Shot Embodied Navigation

Add code
Jun 02, 2026
Viaarxiv icon

Resolving Representation Ambiguity in Feedforward Novel View Synthesis Transformer via Semantic-Spatial Decoupling

Add code
May 18, 2026
Viaarxiv icon

GuidedVLA: Specifying Task-Relevant Factors via Plug-and-Play Action Attention Specialization

Add code
May 12, 2026
Viaarxiv icon

SWIFT: Prompt-Adaptive Memory for Efficient Interactive Long Video Generation

Add code
May 10, 2026
Viaarxiv icon

Attention Itself Could Retrieve.RetrieveVGGT: Training-Free Long Context Streaming 3D Reconstruction via Query-Key Similarity Retrieval

Add code
May 10, 2026
Viaarxiv icon

Bench2Drive-VL: Benchmarks for Closed-Loop Autonomous Driving with Vision-Language Models

Add code
Apr 01, 2026
Viaarxiv icon

Can Users Specify Driving Speed? Bench2Drive-Speed: Benchmark and Baselines for Desired-Speed Conditioned Autonomous Driving

Add code
Mar 26, 2026
Viaarxiv icon

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments

Add code
Mar 03, 2026
Viaarxiv icon

PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models

Add code
Feb 28, 2026
Viaarxiv icon

Efficient-LVSM: Faster, Cheaper, and Better Large View Synthesis Model via Decoupled Co-Refinement Attention

Add code
Feb 06, 2026
Viaarxiv icon