Alert button

"speech": models, code, and papers
Alert button

SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition

Oct 08, 2021
Li Fu, Xiaoxiao Li, Runyu Wang, Zhengchen Zhang, Youzheng Wu, Xiaodong He, Bowen Zhou

Figure 1 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 2 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 3 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Figure 4 for SCaLa: Supervised Contrastive Learning for End-to-End Automatic Speech Recognition
Viaarxiv icon

VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature

Add code
Bookmark button
Alert button
Apr 02, 2022
Chenpeng Du, Yiwei Guo, Xie Chen, Kai Yu

Figure 1 for VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
Figure 2 for VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
Figure 3 for VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
Figure 4 for VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
Viaarxiv icon

Lifelong Learning of Hate Speech Classification on Social Media

Jun 05, 2021
Jing Qian, Hong Wang, Mai ElSherief, Xifeng Yan

Figure 1 for Lifelong Learning of Hate Speech Classification on Social Media
Figure 2 for Lifelong Learning of Hate Speech Classification on Social Media
Figure 3 for Lifelong Learning of Hate Speech Classification on Social Media
Figure 4 for Lifelong Learning of Hate Speech Classification on Social Media
Viaarxiv icon

Deep generative factorization for speech signal

Oct 27, 2020
Haoran Sun, Lantian Li, Yunqi Cai, Yang Zhang, Thomas Fang Zheng, Dong Wang

Figure 1 for Deep generative factorization for speech signal
Figure 2 for Deep generative factorization for speech signal
Figure 3 for Deep generative factorization for speech signal
Figure 4 for Deep generative factorization for speech signal
Viaarxiv icon

Multichannel Speech Enhancement without Beamforming

Add code
Bookmark button
Alert button
Oct 25, 2021
Asutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang

Figure 1 for Multichannel Speech Enhancement without Beamforming
Figure 2 for Multichannel Speech Enhancement without Beamforming
Figure 3 for Multichannel Speech Enhancement without Beamforming
Figure 4 for Multichannel Speech Enhancement without Beamforming
Viaarxiv icon

Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval

Apr 08, 2021
Ramon Sanabria, Austin Waters, Jason Baldridge

Figure 1 for Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval
Figure 2 for Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval
Figure 3 for Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval
Figure 4 for Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval
Viaarxiv icon

Weakly-supervised word-level pronunciation error detection in non-native English speech

Jun 07, 2021
Daniel Korzekwa, Jaime Lorenzo-Trueba, Thomas Drugman, Shira Calamaro, Bozena Kostek

Figure 1 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 2 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 3 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Figure 4 for Weakly-supervised word-level pronunciation error detection in non-native English speech
Viaarxiv icon

Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish

Add code
Bookmark button
Alert button
May 31, 2022
Alp Öktem, Rodolfo Zevallos, Yasmin Moslem, Güneş Öztürk, Karen Şarhon

Figure 1 for Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish
Figure 2 for Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish
Figure 3 for Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish
Figure 4 for Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish
Viaarxiv icon

Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss

Add code
Bookmark button
Alert button
Feb 05, 2022
Arka Mitra, Priyanshu Sankhala

Figure 1 for Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss
Figure 2 for Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss
Figure 3 for Multilingual Hate Speech and Offensive Content Detection using Modified Cross-entropy Loss
Viaarxiv icon

Read it to me: An emotionally aware Speech Narration Application

Add code
Bookmark button
Alert button
Sep 06, 2022
Rishibha Bansal

Figure 1 for Read it to me: An emotionally aware Speech Narration Application
Figure 2 for Read it to me: An emotionally aware Speech Narration Application
Figure 3 for Read it to me: An emotionally aware Speech Narration Application
Figure 4 for Read it to me: An emotionally aware Speech Narration Application
Viaarxiv icon