Picture for Eugene Kharitonov

Eugene Kharitonov

MAD Speech: Measures of Acoustic Diversity of Speech

Add code
Apr 16, 2024
Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Jun 22, 2023
Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

SoundStorm: Efficient Parallel Audio Generation

Add code
May 16, 2023
Figure 1 for SoundStorm: Efficient Parallel Audio Generation
Figure 2 for SoundStorm: Efficient Parallel Audio Generation
Figure 3 for SoundStorm: Efficient Parallel Audio Generation
Figure 4 for SoundStorm: Efficient Parallel Audio Generation
Viaarxiv icon

Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision

Add code
Feb 07, 2023
Figure 1 for Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Figure 2 for Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Figure 3 for Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Figure 4 for Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Viaarxiv icon

AudioLM: a Language Modeling Approach to Audio Generation

Add code
Sep 07, 2022
Figure 1 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 2 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 3 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 4 for AudioLM: a Language Modeling Approach to Audio Generation
Viaarxiv icon

Generative Spoken Dialogue Language Modeling

Add code
Mar 30, 2022
Figure 1 for Generative Spoken Dialogue Language Modeling
Figure 2 for Generative Spoken Dialogue Language Modeling
Figure 3 for Generative Spoken Dialogue Language Modeling
Figure 4 for Generative Spoken Dialogue Language Modeling
Viaarxiv icon

textless-lib: a Library for Textless Spoken Language Processing

Add code
Feb 15, 2022
Figure 1 for textless-lib: a Library for Textless Spoken Language Processing
Figure 2 for textless-lib: a Library for Textless Spoken Language Processing
Figure 3 for textless-lib: a Library for Textless Spoken Language Processing
Figure 4 for textless-lib: a Library for Textless Spoken Language Processing
Viaarxiv icon

Textless Speech Emotion Conversion using Decomposed and Discrete Representations

Add code
Nov 14, 2021
Figure 1 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 2 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 3 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 4 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Viaarxiv icon

How BPE Affects Memorization in Transformers

Add code
Oct 06, 2021
Figure 1 for How BPE Affects Memorization in Transformers
Figure 2 for How BPE Affects Memorization in Transformers
Figure 3 for How BPE Affects Memorization in Transformers
Figure 4 for How BPE Affects Memorization in Transformers
Viaarxiv icon

Text-Free Prosody-Aware Generative Spoken Language Modeling

Add code
Sep 07, 2021
Figure 1 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 2 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 3 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 4 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Viaarxiv icon