Alert button
Picture for Matt Sharifi

Matt Sharifi

Alert button

Social Learning: Towards Collaborative Learning with Large Language Models

Add code
Bookmark button
Alert button
Dec 18, 2023
Amirkeivan Mohtashami, Florian Hartmann, Sian Gooding, Lukas Zilka, Matt Sharifi, Blaise Aguera y Arcas

Viaarxiv icon

AudioPaLM: A Large Language Model That Can Speak and Listen

Add code
Bookmark button
Alert button
Jun 22, 2023
Paul K. Rubenstein, Chulayuth Asawaroengchai, Duc Dung Nguyen, Ankur Bapna, Zalán Borsos, Félix de Chaumont Quitry, Peter Chen, Dalia El Badawy, Wei Han, Eugene Kharitonov, Hannah Muckenhirn, Dirk Padfield, James Qin, Danny Rozenberg, Tara Sainath, Johan Schalkwyk, Matt Sharifi, Michelle Tadmor Ramanovich, Marco Tagliasacchi, Alexandru Tudor, Mihajlo Velimirović, Damien Vincent, Jiahui Yu, Yongqiang Wang, Vicky Zayats, Neil Zeghidour, Yu Zhang, Zhishuai Zhang, Lukas Zilka, Christian Frank

Figure 1 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 2 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 3 for AudioPaLM: A Large Language Model That Can Speak and Listen
Figure 4 for AudioPaLM: A Large Language Model That Can Speak and Listen
Viaarxiv icon

SoundStorm: Efficient Parallel Audio Generation

Add code
Bookmark button
Alert button
May 16, 2023
Zalán Borsos, Matt Sharifi, Damien Vincent, Eugene Kharitonov, Neil Zeghidour, Marco Tagliasacchi

Figure 1 for SoundStorm: Efficient Parallel Audio Generation
Figure 2 for SoundStorm: Efficient Parallel Audio Generation
Figure 3 for SoundStorm: Efficient Parallel Audio Generation
Figure 4 for SoundStorm: Efficient Parallel Audio Generation
Viaarxiv icon

Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision

Add code
Bookmark button
Alert button
Feb 07, 2023
Eugene Kharitonov, Damien Vincent, Zalán Borsos, Raphaël Marinier, Sertan Girgin, Olivier Pietquin, Matt Sharifi, Marco Tagliasacchi, Neil Zeghidour

Figure 1 for Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Figure 2 for Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Figure 3 for Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Figure 4 for Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision
Viaarxiv icon

MusicLM: Generating Music From Text

Add code
Bookmark button
Alert button
Jan 26, 2023
Andrea Agostinelli, Timo I. Denk, Zalán Borsos, Jesse Engel, Mauro Verzetti, Antoine Caillon, Qingqing Huang, Aren Jansen, Adam Roberts, Marco Tagliasacchi, Matt Sharifi, Neil Zeghidour, Christian Frank

Figure 1 for MusicLM: Generating Music From Text
Figure 2 for MusicLM: Generating Music From Text
Figure 3 for MusicLM: Generating Music From Text
Figure 4 for MusicLM: Generating Music From Text
Viaarxiv icon

AudioLM: a Language Modeling Approach to Audio Generation

Add code
Bookmark button
Alert button
Sep 07, 2022
Zalán Borsos, Raphaël Marinier, Damien Vincent, Eugene Kharitonov, Olivier Pietquin, Matt Sharifi, Olivier Teboul, David Grangier, Marco Tagliasacchi, Neil Zeghidour

Figure 1 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 2 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 3 for AudioLM: a Language Modeling Approach to Audio Generation
Figure 4 for AudioLM: a Language Modeling Approach to Audio Generation
Viaarxiv icon

SpeechPainter: Text-conditioned Speech Inpainting

Add code
Bookmark button
Alert button
Feb 15, 2022
Zalán Borsos, Matt Sharifi, Marco Tagliasacchi

Figure 1 for SpeechPainter: Text-conditioned Speech Inpainting
Figure 2 for SpeechPainter: Text-conditioned Speech Inpainting
Figure 3 for SpeechPainter: Text-conditioned Speech Inpainting
Viaarxiv icon

Predicting Text Readability from Scrolling Interactions

Add code
Bookmark button
Alert button
May 13, 2021
Sian Gooding, Yevgeni Berzak, Tony Mak, Matt Sharifi

Figure 1 for Predicting Text Readability from Scrolling Interactions
Figure 2 for Predicting Text Readability from Scrolling Interactions
Figure 3 for Predicting Text Readability from Scrolling Interactions
Figure 4 for Predicting Text Readability from Scrolling Interactions
Viaarxiv icon

SPICE: Self-supervised Pitch Estimation

Add code
Bookmark button
Alert button
Oct 25, 2019
Beat Gfeller, Christian Frank, Dominik Roblek, Matt Sharifi, Marco Tagliasacchi, Mihajlo Velimirović

Figure 1 for SPICE: Self-supervised Pitch Estimation
Figure 2 for SPICE: Self-supervised Pitch Estimation
Figure 3 for SPICE: Self-supervised Pitch Estimation
Figure 4 for SPICE: Self-supervised Pitch Estimation
Viaarxiv icon