Alert button
Picture for Wei-Ning Hsu

Wei-Ning Hsu

Alert button

Towards End-to-end Unsupervised Speech Recognition

Add code
Bookmark button
Alert button
Apr 05, 2022
Alexander H. Liu, Wei-Ning Hsu, Michael Auli, Alexei Baevski

Figure 1 for Towards End-to-end Unsupervised Speech Recognition
Figure 2 for Towards End-to-end Unsupervised Speech Recognition
Figure 3 for Towards End-to-end Unsupervised Speech Recognition
Figure 4 for Towards End-to-end Unsupervised Speech Recognition
Viaarxiv icon

Generative Spoken Dialogue Language Modeling

Add code
Bookmark button
Alert button
Mar 30, 2022
Tu Anh Nguyen, Eugene Kharitonov, Jade Copet, Yossi Adi, Wei-Ning Hsu, Ali Elkahky, Paden Tomasello, Robin Algayres, Benoit Sagot, Abdelrahman Mohamed, Emmanuel Dupoux

Figure 1 for Generative Spoken Dialogue Language Modeling
Figure 2 for Generative Spoken Dialogue Language Modeling
Figure 3 for Generative Spoken Dialogue Language Modeling
Figure 4 for Generative Spoken Dialogue Language Modeling
Viaarxiv icon

Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training

Add code
Bookmark button
Alert button
Mar 02, 2022
Ramon Sanabria, Wei-Ning Hsu, Alexei Baevski, Michael Auli

Figure 1 for Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training
Figure 2 for Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training
Figure 3 for Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training
Figure 4 for Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training
Viaarxiv icon

textless-lib: a Library for Textless Spoken Language Processing

Add code
Bookmark button
Alert button
Feb 15, 2022
Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi

Figure 1 for textless-lib: a Library for Textless Spoken Language Processing
Figure 2 for textless-lib: a Library for Textless Spoken Language Processing
Figure 3 for textless-lib: a Library for Textless Spoken Language Processing
Figure 4 for textless-lib: a Library for Textless Spoken Language Processing
Viaarxiv icon

data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

Add code
Bookmark button
Alert button
Feb 07, 2022
Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli

Figure 1 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 2 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 3 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Figure 4 for data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language
Viaarxiv icon

Robust Self-Supervised Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Jan 05, 2022
Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed

Figure 1 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 2 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 3 for Robust Self-Supervised Audio-Visual Speech Recognition
Figure 4 for Robust Self-Supervised Audio-Visual Speech Recognition
Viaarxiv icon

Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction

Add code
Bookmark button
Alert button
Jan 05, 2022
Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed

Figure 1 for Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Figure 2 for Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Figure 3 for Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Figure 4 for Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Viaarxiv icon

Textless Speech-to-Speech Translation on Real Data

Add code
Bookmark button
Alert button
Dec 15, 2021
Ann Lee, Hongyu Gong, Paul-Ambroise Duquenne, Holger Schwenk, Peng-Jen Chen, Changhan Wang, Sravya Popuri, Juan Pino, Jiatao Gu, Wei-Ning Hsu

Figure 1 for Textless Speech-to-Speech Translation on Real Data
Figure 2 for Textless Speech-to-Speech Translation on Real Data
Figure 3 for Textless Speech-to-Speech Translation on Real Data
Figure 4 for Textless Speech-to-Speech Translation on Real Data
Viaarxiv icon

Textless Speech Emotion Conversion using Decomposed and Discrete Representations

Add code
Bookmark button
Alert button
Nov 14, 2021
Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu-Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi

Figure 1 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 2 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 3 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Figure 4 for Textless Speech Emotion Conversion using Decomposed and Discrete Representations
Viaarxiv icon

Direct simultaneous speech to speech translation

Add code
Bookmark button
Alert button
Oct 15, 2021
Xutai Ma, Hongyu Gong, Danni Liu, Ann Lee, Yun Tang, Peng-Jen Chen, Wei-Ning Hsu, Kenneth Heafield, Phillip Koehn, Juan Pino

Figure 1 for Direct simultaneous speech to speech translation
Figure 2 for Direct simultaneous speech to speech translation
Viaarxiv icon