Alert button

"speech": models, code, and papers
Alert button

Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning

Add code
Bookmark button
Alert button
Aug 23, 2021
Bencheng Wei, Jason Li, Ajay Gupta, Hafiza Umair, Atsu Vovor, Natalie Durzynski

Figure 1 for Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning
Figure 2 for Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning
Figure 3 for Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning
Figure 4 for Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning
Viaarxiv icon

Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors

Feb 27, 2021
Manuel Sam Ribeiro, Joanne Cleland, Aciel Eshky, Korin Richmond, Steve Renals

Figure 1 for Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors
Figure 2 for Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors
Figure 3 for Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors
Figure 4 for Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors
Viaarxiv icon

A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music

Add code
Bookmark button
Alert button
Mar 04, 2021
Hanbin Bae, Jae-Sung Bae, Young-Sun Joo, Young-Ik Kim, Hoon-Young Cho

Figure 1 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 2 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 3 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Figure 4 for A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music
Viaarxiv icon

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline

Add code
Bookmark button
Alert button
Sep 22, 2020
Yerbolat Khassanov, Saida Mussakhojayeva, Almas Mirzakhmetov, Alen Adiyev, Mukhamet Nurpeiissov, Huseyin Atakan Varol

Figure 1 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 2 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 3 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Figure 4 for A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline
Viaarxiv icon

Learning to Inference with Early Exit in the Progressive Speech Enhancement

Add code
Bookmark button
Alert button
Jun 22, 2021
Andong Li, Chengshi Zheng, Lu Zhang, Xiaodong Li

Figure 1 for Learning to Inference with Early Exit in the Progressive Speech Enhancement
Figure 2 for Learning to Inference with Early Exit in the Progressive Speech Enhancement
Figure 3 for Learning to Inference with Early Exit in the Progressive Speech Enhancement
Viaarxiv icon

Context-Aware Prosody Correction for Text-Based Speech Editing

Feb 16, 2021
Max Morrison, Lucas Rencker, Zeyu Jin, Nicholas J. Bryan, Juan-Pablo Caceres, Bryan Pardo

Figure 1 for Context-Aware Prosody Correction for Text-Based Speech Editing
Figure 2 for Context-Aware Prosody Correction for Text-Based Speech Editing
Figure 3 for Context-Aware Prosody Correction for Text-Based Speech Editing
Viaarxiv icon

TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement

Add code
Bookmark button
Alert button
Oct 22, 2021
Ashutosh Pandey, Buye Xu, Anurag Kumar, Jacob Donley, Paul Calamia, DeLiang Wang

Figure 1 for TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement
Figure 2 for TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement
Figure 3 for TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement
Figure 4 for TADRN: Triple-Attentive Dual-Recurrent Network for Ad-hoc Array Multichannel Speech Enhancement
Viaarxiv icon

Longitudinal Sentiment Analyses for Radicalization Research: Intertemporal Dynamics on Social Media Platforms and their Implications

Oct 01, 2022
Dennis Klinkhammer

Viaarxiv icon

Adversarial synthesis based data-augmentation for code-switched spoken language identification

May 30, 2022
Parth Shastri, Chirag Patil, Poorval Wanere, Dr. Shrinivas Mahajan, Dr. Abhishek Bhatt, Dr. Hardik Sailor

Figure 1 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 2 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 3 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Figure 4 for Adversarial synthesis based data-augmentation for code-switched spoken language identification
Viaarxiv icon

Embedding and Beamforming: All-neural Causal Beamformer for Multichannel Speech Enhancement

Add code
Bookmark button
Alert button
Sep 02, 2021
Andong Li, Wenzhe Liu, Chengshi Zheng, Xiaodong Li

Figure 1 for Embedding and Beamforming: All-neural Causal Beamformer for Multichannel Speech Enhancement
Figure 2 for Embedding and Beamforming: All-neural Causal Beamformer for Multichannel Speech Enhancement
Viaarxiv icon