Picture for Hsiao-Ying Huang

Hsiao-Ying Huang

Is CLIP Cross-Eyed? Revealing and Mitigating Center Bias in the CLIP Family

Add code
Apr 07, 2026
Viaarxiv icon

SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models

Add code
Mar 10, 2026
Viaarxiv icon

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

Add code
Jul 03, 2025
Figure 1 for DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Figure 2 for DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Figure 3 for DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Figure 4 for DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
Viaarxiv icon

Revisiting Test-Time Scaling: A Survey and a Diversity-Aware Method for Efficient Reasoning

Add code
Jun 05, 2025
Viaarxiv icon