Alert button

"speech": models, code, and papers
Alert button

audino: A Modern Annotation Tool for Audio and Speech

Add code
Bookmark button
Alert button
Jun 09, 2020
Manraj Singh Grover, Pakhi Bamdev, Yaman Kumar, Mika Hama, Rajiv Ratn Shah

Figure 1 for audino: A Modern Annotation Tool for Audio and Speech
Figure 2 for audino: A Modern Annotation Tool for Audio and Speech
Viaarxiv icon

Multi-task Language Modeling for Improving Speech Recognition of Rare Words

Nov 25, 2020
Chao-Han Huck Yang, Linda Liu, Ankur Gandhe, Yile Gu, Anirudh Raju, Denis Filimonov, Ivan Bulyko

Figure 1 for Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Figure 2 for Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Figure 3 for Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Figure 4 for Multi-task Language Modeling for Improving Speech Recognition of Rare Words
Viaarxiv icon

Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification

Add code
Bookmark button
Alert button
Apr 12, 2022
Xiaolei Huang

Figure 1 for Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification
Figure 2 for Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification
Figure 3 for Easy Adaptation to Mitigate Gender Bias in Multilingual Text Classification
Viaarxiv icon

Deep Learning Models for Multilingual Hate Speech Detection

Add code
Bookmark button
Alert button
Apr 15, 2020
Sai Saketh Aluru, Binny Mathew, Punyajoy Saha, Animesh Mukherjee

Figure 1 for Deep Learning Models for Multilingual Hate Speech Detection
Figure 2 for Deep Learning Models for Multilingual Hate Speech Detection
Figure 3 for Deep Learning Models for Multilingual Hate Speech Detection
Figure 4 for Deep Learning Models for Multilingual Hate Speech Detection
Viaarxiv icon

Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation

Add code
Bookmark button
Alert button
Apr 09, 2021
Yueyue Na, Ziteng Wang, Zhang Liu, Biao Tian, Qiang Fu

Figure 1 for Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Figure 2 for Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Figure 3 for Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Figure 4 for Joint Online Multichannel Acoustic Echo Cancellation, Speech Dereverberation and Source Separation
Viaarxiv icon

Are discrete units necessary for Spoken Language Modeling?

Add code
Bookmark button
Alert button
Mar 11, 2022
Tu Anh Nguyen, Benoit Sagot, Emmanuel Dupoux

Figure 1 for Are discrete units necessary for Spoken Language Modeling?
Figure 2 for Are discrete units necessary for Spoken Language Modeling?
Figure 3 for Are discrete units necessary for Spoken Language Modeling?
Figure 4 for Are discrete units necessary for Spoken Language Modeling?
Viaarxiv icon

ELITR Non-Native Speech Translation at IWSLT 2020

Add code
Bookmark button
Alert button
Jun 05, 2020
Dominik Macháček, Jonáš Kratochvíl, Sangeet Sagar, Matúš Žilinec, Ondřej Bojar, Thai-Son Nguyen, Felix Schneider, Philip Williams, Yuekun Yao

Figure 1 for ELITR Non-Native Speech Translation at IWSLT 2020
Figure 2 for ELITR Non-Native Speech Translation at IWSLT 2020
Figure 3 for ELITR Non-Native Speech Translation at IWSLT 2020
Figure 4 for ELITR Non-Native Speech Translation at IWSLT 2020
Viaarxiv icon

ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet

Add code
Bookmark button
Alert button
Nov 29, 2021
Siddhant Arora, Siddharth Dalmia, Pavel Denisov, Xuankai Chang, Yushi Ueda, Yifan Peng, Yuekai Zhang, Sujay Kumar, Karthik Ganesan, Brian Yan, Ngoc Thang Vu, Alan W Black, Shinji Watanabe

Figure 1 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 2 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 3 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Figure 4 for ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
Viaarxiv icon

VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation

Add code
Bookmark button
Alert button
Jan 02, 2021
Changhan Wang, Morgane Rivière, Ann Lee, Anne Wu, Chaitanya Talnikar, Daniel Haziza, Mary Williamson, Juan Pino, Emmanuel Dupoux

Figure 1 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 2 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 3 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Figure 4 for VoxPopuli: A Large-Scale Multilingual Speech Corpus for Representation Learning, Semi-Supervised Learning and Interpretation
Viaarxiv icon

A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement

Jun 29, 2022
Vijay Ravi, Jinhan Wang, Jonathan Flint, Abeer Alwan

Figure 1 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 2 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 3 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Figure 4 for A Step Towards Preserving Speakers' Identity While Detecting Depression Via Speaker Disentanglement
Viaarxiv icon