Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation



Sravya Popuri , Peng-Jen Chen , Changhan Wang , Juan Pino , Yossi Adi , Jiatao Gu , Wei-Ning Hsu , Ann Lee

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

textless-lib: a Library for Textless Spoken Language Processing



Eugene Kharitonov , Jade Copet , Kushal Lakhotia , Tu Anh Nguyen , Paden Tomasello , Ann Lee , Ali Elkahky , Wei-Ning Hsu , Abdelrahman Mohamed , Emmanuel Dupoux , Yossi Adi

* The library is available here https://github.com/facebookresearch/textlesslib/ 

   Access Paper or Ask Questions

Flashlight: Enabling Innovation in Tools for Machine Learning



Jacob Kahn , Vineel Pratap , Tatiana Likhomanenko , Qiantong Xu , Awni Hannun , Jeff Cai , Paden Tomasello , Ann Lee , Edouard Grave , Gilad Avidov , Benoit Steiner , Vitaliy Liptchinsky , Gabriel Synnaeve , Ronan Collobert


   Access Paper or Ask Questions

Textless Speech-to-Speech Translation on Real Data



Ann Lee , Hongyu Gong , Paul-Ambroise Duquenne , Holger Schwenk , Peng-Jen Chen , Changhan Wang , Sravya Popuri , Juan Pino , Jiatao Gu , Wei-Ning Hsu


   Access Paper or Ask Questions

Direct simultaneous speech to speech translation



Xutai Ma , Hongyu Gong , Danni Liu , Ann Lee , Yun Tang , Peng-Jen Chen , Wei-Ning Hsu , Kenneth Heafield , Phillip Koehn , Juan Pino


   Access Paper or Ask Questions

fairseq S^2: A Scalable and Integrable Speech Synthesis Toolkit



Changhan Wang , Wei-Ning Hsu , Yossi Adi , Adam Polyak , Ann Lee , Peng-Jen Chen , Jiatao Gu , Juan Pino

* Accepted to EMNLP 2021 Demo 

   Access Paper or Ask Questions

Text-Free Prosody-Aware Generative Spoken Language Modeling



Eugene Kharitonov , Ann Lee , Adam Polyak , Yossi Adi , Jade Copet , Kushal Lakhotia , Tu-Anh Nguyen , Morgane Rivière , Abdelrahman Mohamed , Emmanuel Dupoux , Wei-Ning Hsu


   Access Paper or Ask Questions

Direct speech-to-speech translation with discrete units



Ann Lee , Peng-Jen Chen , Changhan Wang , Jiatao Gu , Xutai Ma , Adam Polyak , Yossi Adi , Qing He , Yun Tang , Juan Pino , Wei-Ning Hsu


   Access Paper or Ask Questions

Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training



Wei-Ning Hsu , Anuroop Sriram , Alexei Baevski , Tatiana Likhomanenko , Qiantong Xu , Vineel Pratap , Jacob Kahn , Ann Lee , Ronan Collobert , Gabriel Synnaeve , Michael Auli


   Access Paper or Ask Questions

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation



Changhan Wang , Morgane Rivière , Ann Lee , Anne Wu , Chaitanya Talnikar , Daniel Haziza , Mary Williamson , Juan Pino , Emmanuel Dupoux


   Access Paper or Ask Questions

1
2
>>