Alert button

"speech": models, code, and papers
Alert button

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Oct 12, 2022
Shuhao Deng, Chengfei Li, infeng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge
Viaarxiv icon

Addressing the Challenges of Cross-Lingual Hate Speech Detection

Jan 15, 2022
Irina Bigoulaeva, Viktor Hangya, Iryna Gurevych, Alexander Fraser

Figure 1 for Addressing the Challenges of Cross-Lingual Hate Speech Detection
Figure 2 for Addressing the Challenges of Cross-Lingual Hate Speech Detection
Figure 3 for Addressing the Challenges of Cross-Lingual Hate Speech Detection
Figure 4 for Addressing the Challenges of Cross-Lingual Hate Speech Detection
Viaarxiv icon

MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient

Mar 16, 2022
Andong Li, Chengshi Zheng, Ziyang Zhang, Xiaodong Li

Figure 1 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 2 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 3 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Figure 4 for MDNet: Learning Monaural Speech Enhancement from Deep Prior Gradient
Viaarxiv icon

Conversation-oriented ASR with multi-look-ahead CBS architecture

Nov 02, 2022
Huaibo Zhao, Shinya Fujie, Tetsuji Ogawa, Jin Sakuma, Yusuke Kida, Tetsunori Kobayashi

Figure 1 for Conversation-oriented ASR with multi-look-ahead CBS architecture
Figure 2 for Conversation-oriented ASR with multi-look-ahead CBS architecture
Figure 3 for Conversation-oriented ASR with multi-look-ahead CBS architecture
Viaarxiv icon

Modernizing Open-Set Speech Language Identification

May 20, 2022
Mustafa Eyceoz, Justin Lee, Homayoon Beigi

Figure 1 for Modernizing Open-Set Speech Language Identification
Figure 2 for Modernizing Open-Set Speech Language Identification
Figure 3 for Modernizing Open-Set Speech Language Identification
Figure 4 for Modernizing Open-Set Speech Language Identification
Viaarxiv icon

Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling

Mar 15, 2022
Tiantian Feng, Shrikanth Narayanan

Figure 1 for Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling
Figure 2 for Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling
Figure 3 for Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling
Viaarxiv icon

Low Latency Time Domain Multichannel Speech and Music Source Separation

Apr 12, 2022
Gerald Schuller

Figure 1 for Low Latency Time Domain Multichannel Speech and Music Source Separation
Figure 2 for Low Latency Time Domain Multichannel Speech and Music Source Separation
Figure 3 for Low Latency Time Domain Multichannel Speech and Music Source Separation
Viaarxiv icon

Adaptive multilingual speech recognition with pretrained models

May 24, 2022
Ngoc-Quan Pham, Alex Waibel, Jan Niehues

Figure 1 for Adaptive multilingual speech recognition with pretrained models
Viaarxiv icon

Filter-based Discriminative Autoencoders for Children Speech Recognition

Apr 01, 2022
Chiang-Lin Tai, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang

Figure 1 for Filter-based Discriminative Autoencoders for Children Speech Recognition
Figure 2 for Filter-based Discriminative Autoencoders for Children Speech Recognition
Figure 3 for Filter-based Discriminative Autoencoders for Children Speech Recognition
Figure 4 for Filter-based Discriminative Autoencoders for Children Speech Recognition
Viaarxiv icon

NusaCrowd: Open Source Initiative for Indonesian NLP Resources

Dec 20, 2022
Samuel Cahyawijaya, Holy Lovenia, Alham Fikri Aji, Genta Indra Winata, Bryan Wilie, Rahmad Mahendra, Christian Wibisono, Ade Romadhony, Karissa Vincentio, Fajri Koto, Jennifer Santoso, David Moeljadi, Cahya Wirawan, Frederikus Hudi, Ivan Halim Parmonangan, Ika Alfina, Muhammad Satrio Wicaksono, Ilham Firdausi Putra, Samsul Rahmadani, Yulianti Oenang, Ali Akbar Septiandri, James Jaya, Kaustubh D. Dhole, Arie Ardiyanti Suryani, Rifki Afina Putri, Dan Su, Keith Stevens, Made Nindyatama Nityasya, Muhammad Farid Adilazuarda, Ryan Ignatius, Ryandito Diandaru, Tiezheng Yu, Vito Ghifari, Wenliang Dai, Yan Xu, Dyah Damapuspita, Cuk Tho, Ichwanul Muslim Karo Karo, Tirana Noor Fatyanosa, Ziwei Ji, Pascale Fung, Graham Neubig, Timothy Baldwin, Sebastian Ruder, Herry Sujaini, Sakriani Sakti, Ayu Purwarianti

Figure 1 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 2 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 3 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Figure 4 for NusaCrowd: Open Source Initiative for Indonesian NLP Resources
Viaarxiv icon