Picture for Longbiao Wang

Longbiao Wang

Separate First, Fuse Later: Mitigating Cross-Modal Interference in Audio-Visual LLMs Reasoning with Modality-Specific Chain-of-Thought

Add code
May 11, 2026
Viaarxiv icon

Evaluating the Expressive Appropriateness of Speech in Rich Contexts

Add code
May 10, 2026
Viaarxiv icon

Compositional Multi-hop Factual Error Correction via Decomposition-and-Injection

Add code
May 04, 2026
Viaarxiv icon

UniSonate: A Unified Model for Speech, Music, and Sound Effect Generation with Text Instructions

Add code
Apr 24, 2026
Viaarxiv icon

MSR-HuBERT: Self-supervised Pre-training for Adaptation to Multiple Sampling Rates

Add code
Mar 24, 2026
Viaarxiv icon

Breaking Data Efficiency Dilemma: A Federated and Augmented Learning Framework For Alzheimer's Disease Detection via Speech

Add code
Feb 16, 2026
Viaarxiv icon

POTSA: A Cross-Lingual Speech Alignment Framework for Low Resource Speech-to-Text Translation

Add code
Nov 12, 2025
Viaarxiv icon

ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning

Add code
Jul 03, 2025
Figure 1 for ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning
Figure 2 for ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning
Figure 3 for ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning
Figure 4 for ASDA: Audio Spectrogram Differential Attention Mechanism for Self-Supervised Representation Learning
Viaarxiv icon

Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework

Add code
Jun 10, 2025
Viaarxiv icon

Rethinking Contrastive Learning in Graph Anomaly Detection: A Clean-View Perspective

Add code
May 23, 2025
Viaarxiv icon