Picture for Qiaolin Wang

Qiaolin Wang

SightSound-R1: Cross-Modal Reasoning Distillation from Vision to Audio Language Models

Add code
Sep 19, 2025
Viaarxiv icon

Layer-wise Minimal Pair Probing Reveals Contextual Grammatical-Conceptual Hierarchy in Speech Representations

Add code
Sep 19, 2025
Viaarxiv icon