Alert button
Picture for Yossi Adi

Yossi Adi

Alert button

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

Add code
Bookmark button
Alert button
Jun 23, 2023
Matthew Le, Apoorv Vyas, Bowen Shi, Brian Karrer, Leda Sari, Rashel Moritz, Mary Williamson, Vimal Manohar, Yossi Adi, Jay Mahadeokar, Wei-Ning Hsu

Figure 1 for Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Figure 2 for Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Figure 3 for Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Figure 4 for Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
Viaarxiv icon

Simple and Controllable Music Generation

Add code
Bookmark button
Alert button
Jun 08, 2023
Jade Copet, Felix Kreuk, Itai Gat, Tal Remez, David Kant, Gabriel Synnaeve, Yossi Adi, Alexandre Défossez

Figure 1 for Simple and Controllable Music Generation
Figure 2 for Simple and Controllable Music Generation
Figure 3 for Simple and Controllable Music Generation
Figure 4 for Simple and Controllable Music Generation
Viaarxiv icon

Scaling Speech Technology to 1,000+ Languages

Add code
Bookmark button
Alert button
May 22, 2023
Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli

Figure 1 for Scaling Speech Technology to 1,000+ Languages
Figure 2 for Scaling Speech Technology to 1,000+ Languages
Figure 3 for Scaling Speech Technology to 1,000+ Languages
Figure 4 for Scaling Speech Technology to 1,000+ Languages
Viaarxiv icon

AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

Add code
Bookmark button
Alert button
May 22, 2023
Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz

Figure 1 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 2 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 3 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 4 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Viaarxiv icon

Textually Pretrained Speech Language Models

Add code
Bookmark button
Alert button
May 22, 2023
Michael Hassid, Tal Remez, Tu Anh Nguyen, Itai Gat, Alexis Conneau, Felix Kreuk, Jade Copet, Alexandre Defossez, Gabriel Synnaeve, Emmanuel Dupoux, Roy Schwartz, Yossi Adi

Figure 1 for Textually Pretrained Speech Language Models
Figure 2 for Textually Pretrained Speech Language Models
Figure 3 for Textually Pretrained Speech Language Models
Figure 4 for Textually Pretrained Speech Language Models
Viaarxiv icon

Layer Collaboration in the Forward-Forward Algorithm

Add code
Bookmark button
Alert button
May 21, 2023
Guy Lorberbom, Itai Gat, Yossi Adi, Alex Schwing, Tamir Hazan

Figure 1 for Layer Collaboration in the Forward-Forward Algorithm
Figure 2 for Layer Collaboration in the Forward-Forward Algorithm
Figure 3 for Layer Collaboration in the Forward-Forward Algorithm
Figure 4 for Layer Collaboration in the Forward-Forward Algorithm
Viaarxiv icon

A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation

Add code
Bookmark button
Alert button
Jan 25, 2023
Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen

Figure 1 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 2 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 3 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 4 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Viaarxiv icon

Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling

Add code
Bookmark button
Alert button
Jan 02, 2023
Amitay Sicherman, Yossi Adi

Figure 1 for Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling
Figure 2 for Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling
Figure 3 for Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling
Figure 4 for Analysing Discrete Self Supervised Speech Representation for Spoken Language Modeling
Viaarxiv icon

ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement

Add code
Bookmark button
Alert button
Dec 21, 2022
Wei-Ning Hsu, Tal Remez, Bowen Shi, Jacob Donley, Yossi Adi

Figure 1 for ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Figure 2 for ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Figure 3 for ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Figure 4 for ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement
Viaarxiv icon

Speaking Style Conversion With Discrete Self-Supervised Units

Add code
Bookmark button
Alert button
Dec 19, 2022
Gallil Maimon, Yossi Adi

Figure 1 for Speaking Style Conversion With Discrete Self-Supervised Units
Figure 2 for Speaking Style Conversion With Discrete Self-Supervised Units
Figure 3 for Speaking Style Conversion With Discrete Self-Supervised Units
Figure 4 for Speaking Style Conversion With Discrete Self-Supervised Units
Viaarxiv icon