Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities



Hsiang-Sheng Tsai , Heng-Jui Chang , Wen-Chin Huang , Zili Huang , Kushal Lakhotia , Shu-wen Yang , Shuyan Dong , Andy T. Liu , Cheng-I Jeff Lai , Jiatong Shi , Xuankai Chang , Phil Hall , Hsuan-Jui Chen , Shang-Wen Li , Shinji Watanabe , Abdelrahman Mohamed , Hung-yi Lee

* ACL 2022 main conference 

   Access Paper or Ask Questions

textless-lib: a Library for Textless Spoken Language Processing



Eugene Kharitonov , Jade Copet , Kushal Lakhotia , Tu Anh Nguyen , Paden Tomasello , Ann Lee , Ali Elkahky , Wei-Ning Hsu , Abdelrahman Mohamed , Emmanuel Dupoux , Yossi Adi

* The library is available here https://github.com/facebookresearch/textlesslib/ 

   Access Paper or Ask Questions

Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction



Bowen Shi , Wei-Ning Hsu , Kushal Lakhotia , Abdelrahman Mohamed


   Access Paper or Ask Questions

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale



Arun Babu , Changhan Wang , Andros Tjandra , Kushal Lakhotia , Qiantong Xu , Naman Goyal , Kritika Singh , Patrick von Platen , Yatharth Saraf , Juan Pino , Alexei Baevski , Alexis Conneau , Michael Auli


   Access Paper or Ask Questions

Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?



Xilun Chen , Kushal Lakhotia , Barlas Oğuz , Anchit Gupta , Patrick Lewis , Stan Peshterliev , Yashar Mehdad , Sonal Gupta , Wen-tau Yih


   Access Paper or Ask Questions

Text-Free Prosody-Aware Generative Spoken Language Modeling



Eugene Kharitonov , Ann Lee , Adam Polyak , Yossi Adi , Jade Copet , Kushal Lakhotia , Tu-Anh Nguyen , Morgane Rivière , Abdelrahman Mohamed , Emmanuel Dupoux , Wei-Ning Hsu


   Access Paper or Ask Questions

Domain-matched Pre-training Tasks for Dense Retrieval



Barlas Oğuz , Kushal Lakhotia , Anchit Gupta , Patrick Lewis , Vladimir Karpukhin , Aleksandra Piktus , Xilun Chen , Sebastian Riedel , Wen-tau Yih , Sonal Gupta , Yashar Mehdad


   Access Paper or Ask Questions

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units



Wei-Ning Hsu , Benjamin Bolte , Yao-Hung Hubert Tsai , Kushal Lakhotia , Ruslan Salakhutdinov , Abdelrahman Mohamed


   Access Paper or Ask Questions

SUPERB: Speech processing Universal PERformance Benchmark



Shu-wen Yang , Po-Han Chi , Yung-Sung Chuang , Cheng-I Jeff Lai , Kushal Lakhotia , Yist Y. Lin , Andy T. Liu , Jiatong Shi , Xuankai Chang , Guan-Ting Lin , Tzu-Hsien Huang , Wei-Cheng Tseng , Ko-tik Lee , Da-Rong Liu , Zili Huang , Shuyan Dong , Shang-Wen Li , Shinji Watanabe , Abdelrahman Mohamed , Hung-yi Lee

* Submitted to Interspeech 2021 

   Access Paper or Ask Questions

1
2
>>