Picture for Siyu Xie

Siyu Xie

Audio-VLA: Adding Contact Audio Perception to Vision-Language-Action Model for Robotic Manipulation

Add code
Nov 13, 2025
Viaarxiv icon