Picture for Xiaosong Jia

Xiaosong Jia

ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments

Add code
Mar 03, 2026
Viaarxiv icon

PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models

Add code
Feb 28, 2026
Viaarxiv icon

Efficient-LVSM: Faster, Cheaper, and Better Large View Synthesis Model via Decoupled Co-Refinement Attention

Add code
Feb 06, 2026
Viaarxiv icon

Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank

Add code
Dec 13, 2025
Figure 1 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 2 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 3 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Figure 4 for Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank
Viaarxiv icon

ReSim: Reliable World Simulation for Autonomous Driving

Add code
Jun 11, 2025
Viaarxiv icon

Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)

Add code
May 22, 2025
Viaarxiv icon

DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving

Add code
May 22, 2025
Figure 1 for DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving
Figure 2 for DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving
Figure 3 for DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving
Figure 4 for DriveMoE: Mixture-of-Experts for Vision-Language-Action Model in End-to-End Autonomous Driving
Viaarxiv icon

Interleave-VLA: Enhancing Robot Manipulation with Interleaved Image-Text Instructions

Add code
May 04, 2025
Viaarxiv icon

DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving

Add code
Mar 07, 2025
Viaarxiv icon

PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection

Add code
Jan 23, 2025
Figure 1 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 2 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 3 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Figure 4 for PointOBB-v3: Expanding Performance Boundaries of Single Point-Supervised Oriented Object Detection
Viaarxiv icon