Picture for Insung Lee

Insung Lee

Omni-Embed-Audio: Leveraging Multimodal LLMs for Robust Audio-Text Retrieval

Add code
Apr 20, 2026
Viaarxiv icon

CAF-Score: Calibrating CLAP with LALMs for Reference-free Audio Captioning Evaluation

Add code
Mar 20, 2026
Viaarxiv icon