Picture for Houhua Zhang

Houhua Zhang

Spatial-Omni: Spatial Audio Understanding Integration in Multimodal LLMs via FOA Encoding

Add code
Jun 09, 2026
Viaarxiv icon