Picture for Xiuwei Xu

Xiuwei Xu

CMP: Robust Whole-Body Tracking for Loco-Manipulation via Competence Manifold Projection

Add code
Apr 08, 2026
Viaarxiv icon

F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation

Add code
Apr 02, 2026
Viaarxiv icon

iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion

Add code
Nov 18, 2025
Viaarxiv icon

Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline

Add code
Aug 06, 2025
Viaarxiv icon

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation

Add code
Aug 01, 2025
Figure 1 for IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Figure 2 for IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Figure 3 for IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Figure 4 for IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Viaarxiv icon

EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models

Add code
Mar 19, 2025
Figure 1 for EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
Figure 2 for EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
Figure 3 for EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
Figure 4 for EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models
Viaarxiv icon

MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation

Add code
Mar 17, 2025
Figure 1 for MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
Figure 2 for MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
Figure 3 for MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
Figure 4 for MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
Viaarxiv icon

UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Add code
Mar 13, 2025
Viaarxiv icon

Q-VLM: Post-training Quantization for Large Vision-Language Models

Add code
Oct 10, 2024
Figure 1 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 2 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 3 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Figure 4 for Q-VLM: Post-training Quantization for Large Vision-Language Models
Viaarxiv icon

SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation

Add code
Oct 10, 2024
Figure 1 for SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Figure 2 for SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Figure 3 for SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Figure 4 for SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
Viaarxiv icon