Picture for Eugene Kharitonov

Eugene Kharitonov

Dima

MoshiRAG: Asynchronous Knowledge Retrieval for Full-Duplex Speech Language Models

Add code
Apr 14, 2026
Viaarxiv icon

Streaming Sequence-to-Sequence Learning with Delayed Streams Modeling

Add code
Sep 10, 2025
Viaarxiv icon

Gemma 3 Technical Report

Add code
Mar 25, 2025
Viaarxiv icon

MAD Speech: Measures of Acoustic Diversity of Speech

Add code
Apr 16, 2024
Figure 1 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 2 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 3 for MAD Speech: Measures of Acoustic Diversity of Speech
Figure 4 for MAD Speech: Measures of Acoustic Diversity of Speech
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

SoundStorm: Efficient Parallel Audio Generation

Add code
May 16, 2023
Viaarxiv icon

Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision

Add code
Feb 07, 2023
Viaarxiv icon

AudioLM: a Language Modeling Approach to Audio Generation

Add code
Sep 07, 2022
Figure 1 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 2 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 3 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 4 for AudioLM: a Language Modeling Approach to Audio Generation
Viaarxiv icon

Generative Spoken Dialogue Language Modeling

Add code
Mar 30, 2022
Figure 1 for Generative Spoken Dialogue Language Modeling
Figure 2 for Generative Spoken Dialogue Language Modeling
Figure 3 for Generative Spoken Dialogue Language Modeling
Figure 4 for Generative Spoken Dialogue Language Modeling
Viaarxiv icon

textless-lib: a Library for Textless Spoken Language Processing

Add code
Feb 15, 2022
Figure 1 for textless-lib: a Library for Textless Spoken Language Processing
Figure 2 for textless-lib: a Library for Textless Spoken Language Processing
Figure 3 for textless-lib: a Library for Textless Spoken Language Processing
Figure 4 for textless-lib: a Library for Textless Spoken Language Processing
Viaarxiv icon