Picture for Xiaobao Wei

Xiaobao Wei

GraspFoM: Towards Reconstruction-Driven Robotic Grasping with 3D Foundation Priors

Add code
Jun 07, 2026
Viaarxiv icon

SparseStreet: Sparse Gaussian Splatting for Real-Time Street Scene Simulation

Add code
Jun 02, 2026
Viaarxiv icon

Feed-Forward Gaussian Splatting from Sparse Aerial Views

Add code
May 19, 2026
Viaarxiv icon

VEGA: Visual Encoder Grounding Alignment for Spatially-Aware Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

PhyMix: Towards Physically Consistent Single-Image 3D Indoor Scene Generation with Implicit--Explicit Optimization

Add code
Apr 11, 2026
Viaarxiv icon

EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation

Add code
Mar 10, 2026
Viaarxiv icon

RSATalker: Realistic Socially-Aware Talking Head Generation for Multi-Turn Conversation

Add code
Jan 15, 2026
Viaarxiv icon

ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking

Add code
Jan 04, 2026
Viaarxiv icon

PIGEON: VLM-Driven Object Navigation via Points of Interest Selection

Add code
Nov 17, 2025
Figure 1 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 2 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 3 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Figure 4 for PIGEON: VLM-Driven Object Navigation via Points of Interest Selection
Viaarxiv icon

Rethinking Driving World Model as Synthetic Data Generator for Perception Tasks

Add code
Oct 22, 2025
Viaarxiv icon