Picture for Neil Zeghidour

Neil Zeghidour

PSL, FAIR, LSCP

Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling

Add code
Sep 10, 2025
Viaarxiv icon

Continuous Audio Language Models

Add code
Sep 09, 2025
Viaarxiv icon

Aligning Spoken Dialogue Models from User Interactions

Add code
Jun 26, 2025
Viaarxiv icon

CaReAQA: A Cardiac and Respiratory Audio Question Answering Model for Open-Ended Diagnostic Reasoning

Add code
May 02, 2025
Viaarxiv icon

Vision-Speech Models: Teaching Speech Models to Converse about Images

Add code
Mar 19, 2025
Viaarxiv icon

High-Fidelity Simultaneous Speech-To-Speech Translation

Add code
Feb 05, 2025
Viaarxiv icon

MAD Speech: Measures of Acoustic Diversity of Speech

Add code
Apr 16, 2024
Figure 1 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 2 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 3 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 4 for MAD Speech: Measures of Acoustic Diversity of Speech
Viaarxiv icon

MusicRL: Aligning Music Generation to Human Preferences

Add code
Feb 06, 2024
Viaarxiv icon

TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition

Add code
Aug 21, 2023
Figure 1 for TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Figure 2 for TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Figure 3 for TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon