Alert button
Picture for Kushal Lakhotia

Kushal Lakhotia

Alert button

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities

Mar 14, 2022
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

Figure 1 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 2 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 3 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 4 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Viaarxiv icon

textless-lib: a Library for Textless Spoken Language Processing

Feb 15, 2022
Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi

Figure 1 for textless-lib: a Library for Textless Spoken Language Processing
Figure 2 for textless-lib: a Library for Textless Spoken Language Processing
Figure 3 for textless-lib: a Library for Textless Spoken Language Processing
Figure 4 for textless-lib: a Library for Textless Spoken Language Processing
Viaarxiv icon

Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction

Jan 05, 2022
Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed

Figure 1 for Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Figure 2 for Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Figure 3 for Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Figure 4 for Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction
Viaarxiv icon

XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale

Nov 19, 2021
Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli

Figure 1 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 2 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 3 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Figure 4 for XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale
Viaarxiv icon

Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

Oct 13, 2021
Xilun Chen, Kushal Lakhotia, Barlas Oğuz, Anchit Gupta, Patrick Lewis, Stan Peshterliev, Yashar Mehdad, Sonal Gupta, Wen-tau Yih

Figure 1 for Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?
Figure 2 for Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?
Figure 3 for Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?
Figure 4 for Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?
Viaarxiv icon

Text-Free Prosody-Aware Generative Spoken Language Modeling

Sep 07, 2021
Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu-Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu

Figure 1 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 2 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 3 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Figure 4 for Text-Free Prosody-Aware Generative Spoken Language Modeling
Viaarxiv icon

Domain-matched Pre-training Tasks for Dense Retrieval

Jul 28, 2021
Barlas Oğuz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Wen-tau Yih, Sonal Gupta, Yashar Mehdad

Figure 1 for Domain-matched Pre-training Tasks for Dense Retrieval
Figure 2 for Domain-matched Pre-training Tasks for Dense Retrieval
Figure 3 for Domain-matched Pre-training Tasks for Dense Retrieval
Figure 4 for Domain-matched Pre-training Tasks for Dense Retrieval
Viaarxiv icon

HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units

Jun 14, 2021
Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov, Abdelrahman Mohamed

Figure 1 for HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Figure 2 for HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Figure 3 for HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Figure 4 for HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
Viaarxiv icon

SUPERB: Speech processing Universal PERformance Benchmark

May 03, 2021
Shu-wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

Figure 1 for SUPERB: Speech processing Universal PERformance Benchmark
Figure 2 for SUPERB: Speech processing Universal PERformance Benchmark
Viaarxiv icon