Picture for Neil Zeghidour

Neil Zeghidour

PSL, FAIR, LSCP

Adaptive Test-Time Scaling for Zero-Shot Respiratory Audio Classification

Add code
Apr 14, 2026
Viaarxiv icon

MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models

Add code
Apr 14, 2026
Viaarxiv icon

Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling

Add code
Sep 10, 2025
Viaarxiv icon

Continuous Audio Language Models

Add code
Sep 09, 2025
Viaarxiv icon

Aligning Spoken Dialogue Models from User Interactions

Add code
Jun 26, 2025
Viaarxiv icon

CaReAQA: A Cardiac and Respiratory Audio Question Answering Model for Open-Ended Diagnostic Reasoning

Add code
May 02, 2025
Viaarxiv icon

Vision-Speech Models: Teaching Speech Models to Converse about Images

Add code
Mar 19, 2025
Viaarxiv icon

High-Fidelity Simultaneous Speech-To-Speech Translation

Add code
Feb 05, 2025
Figure 1 for High-Fidelity Simultaneous Speech-To-Speech Translation
Figure 2 for High-Fidelity Simultaneous Speech-To-Speech Translation
Figure 3 for High-Fidelity Simultaneous Speech-To-Speech Translation
Figure 4 for High-Fidelity Simultaneous Speech-To-Speech Translation
Viaarxiv icon

MAD Speech: Measures of Acoustic Diversity of Speech

Add code
Apr 16, 2024
Figure 1 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 2 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 3 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 4 for MAD Speech: Measures of Acoustic Diversity of Speech
Viaarxiv icon

MusicRL: Aligning Music Generation to Human Preferences

Add code
Feb 06, 2024
Viaarxiv icon