Picture for Mirco Ravanelli

Mirco Ravanelli

Beyond Fixed Frames: Dynamic Character-Aligned Speech Tokenization

Add code
Jan 30, 2026
Viaarxiv icon

Toward Faithful Explanations in Acoustic Anomaly Detection

Add code
Jan 19, 2026
Viaarxiv icon

Comparison of Speech Tasks in Human Expert and Machine Detection of Parkinson's Disease

Add code
Oct 08, 2025
Viaarxiv icon

Investigating Faithfulness in Large Audio Language Models

Add code
Sep 26, 2025
Viaarxiv icon

FocalCodec-Stream: Streaming Low-Bitrate Speech Coding via Causal Distillation

Add code
Sep 19, 2025
Viaarxiv icon

Audio Prototypical Network For Controllable Music Recommendation

Add code
Jul 31, 2025
Viaarxiv icon

Discrete Audio Tokens: More Than a Survey!

Add code
Jun 12, 2025
Viaarxiv icon

ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs

Add code
May 26, 2025
Viaarxiv icon

LiSTEN: Learning Soft Token Embeddings for Neural Audio LLMs

Add code
May 24, 2025
Viaarxiv icon

Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down

Add code
May 19, 2025
Viaarxiv icon