Alert button

"speech": models, code, and papers
Alert button

Automatic Speech Summarisation: A Scoping Review

Aug 27, 2020
Dana Rezazadegan, Shlomo Berkovsky, Juan C. Quiroz, A. Baki Kocaballi, Ying Wang, Liliana Laranjo, Enrico Coiera

Figure 1 for Automatic Speech Summarisation: A Scoping Review
Figure 2 for Automatic Speech Summarisation: A Scoping Review
Figure 3 for Automatic Speech Summarisation: A Scoping Review
Figure 4 for Automatic Speech Summarisation: A Scoping Review
Viaarxiv icon

ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection

Add code
Bookmark button
Alert button
Mar 17, 2022
Thomas Hartvigsen, Saadia Gabriel, Hamid Palangi, Maarten Sap, Dipankar Ray, Ece Kamar

Figure 1 for ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection
Figure 2 for ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection
Figure 3 for ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection
Figure 4 for ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech Detection
Viaarxiv icon

Learning to Inference with Early Exit in the Progressive Speech Enhancement

Add code
Bookmark button
Alert button
Jun 22, 2021
Andong Li, Chengshi Zheng, Lu Zhang, Xiaodong Li

Figure 1 for Learning to Inference with Early Exit in the Progressive Speech Enhancement
Figure 2 for Learning to Inference with Early Exit in the Progressive Speech Enhancement
Figure 3 for Learning to Inference with Early Exit in the Progressive Speech Enhancement
Viaarxiv icon

Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning

Add code
Bookmark button
Alert button
Aug 23, 2021
Bencheng Wei, Jason Li, Ajay Gupta, Hafiza Umair, Atsu Vovor, Natalie Durzynski

Figure 1 for Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning
Figure 2 for Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning
Figure 3 for Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning
Figure 4 for Offensive Language and Hate Speech Detection with Deep Learning and Transfer Learning
Viaarxiv icon

Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge

Feb 24, 2022
Yen-Ju Lu, Samuele Cornell, Xuankai Chang, Wangyou Zhang, Chenda Li, Zhaoheng Ni, Zhong-Qiu Wang, Shinji Watanabe

Figure 1 for Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge
Figure 2 for Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge
Figure 3 for Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge
Figure 4 for Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge
Viaarxiv icon

V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization

Add code
Bookmark button
Alert button
Oct 27, 2022
Jiangyi Deng, Fei Teng, Yanjiao Chen, Xiaofu Chen, Zhaohui Wang, Wenyuan Xu

Figure 1 for V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization
Figure 2 for V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization
Figure 3 for V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization
Figure 4 for V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization
Viaarxiv icon

Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs

Add code
Bookmark button
Alert button
Oct 27, 2022
Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li

Figure 1 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 2 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 3 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Figure 4 for Self-Supervised Training of Speaker Encoder with Multi-Modal Diverse Positive Pairs
Viaarxiv icon

Towards MOOCs for Lip Reading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale

Aug 21, 2022
Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay Namboodiri, C. V Jawahar

Figure 1 for Towards MOOCs for Lip Reading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
Figure 2 for Towards MOOCs for Lip Reading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
Figure 3 for Towards MOOCs for Lip Reading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
Figure 4 for Towards MOOCs for Lip Reading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale
Viaarxiv icon

VocBench: A Neural Vocoder Benchmark for Speech Synthesis

Add code
Bookmark button
Alert button
Dec 06, 2021
Ehab A. AlBadawy, Andrew Gibiansky, Qing He, Jilong Wu, Ming-Ching Chang, Siwei Lyu

Figure 1 for VocBench: A Neural Vocoder Benchmark for Speech Synthesis
Figure 2 for VocBench: A Neural Vocoder Benchmark for Speech Synthesis
Figure 3 for VocBench: A Neural Vocoder Benchmark for Speech Synthesis
Viaarxiv icon

Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis

Dec 14, 2020
Neeraj Kumar, Srishti Goel, Ankur Narang, Brejesh Lall

Figure 1 for Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Figure 2 for Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Figure 3 for Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Figure 4 for Few Shot Adaptive Normalization Driven Multi-Speaker Speech Synthesis
Viaarxiv icon