Picture for Shihan Wu

Shihan Wu

InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning

Add code
May 20, 2025
Viaarxiv icon

Policy Contrastive Decoding for Robotic Foundation Models

Add code
May 19, 2025
Viaarxiv icon

Skip Tuning: Pre-trained Vision-Language Models are Effective and Efficient Adapters Themselves

Add code
Dec 16, 2024
Viaarxiv icon

DePT: Decoupled Prompt Tuning

Add code
Sep 14, 2023
Viaarxiv icon