Alert button

"speech": models, code, and papers
Alert button

Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning

Add code
Bookmark button
Alert button
Feb 21, 2023
Shakeel A. Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni

Figure 1 for Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning
Figure 2 for Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning
Figure 3 for Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning
Figure 4 for Advancing Stuttering Detection via Data Augmentation, Class-Balanced Loss and Multi-Contextual Deep Learning
Viaarxiv icon

Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection

Add code
Bookmark button
Alert button
Oct 17, 2022
Tulika Bose, Irina Illina, Dominique Fohr

Figure 1 for Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection
Figure 2 for Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection
Figure 3 for Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection
Figure 4 for Transferring Knowledge via Neighborhood-Aware Optimal Transport for Low-Resource Hate Speech Detection
Viaarxiv icon

Real-time Speech Interruption Analysis: From Cloud to Client Deployment

Add code
Bookmark button
Alert button
Oct 24, 2022
Quchen Fu, Szu-Wei Fu, Yaran Fan, Yu Wu, Zhuo Chen, Jayant Gupchup, Ross Cutler

Figure 1 for Real-time Speech Interruption Analysis: From Cloud to Client Deployment
Figure 2 for Real-time Speech Interruption Analysis: From Cloud to Client Deployment
Figure 3 for Real-time Speech Interruption Analysis: From Cloud to Client Deployment
Figure 4 for Real-time Speech Interruption Analysis: From Cloud to Client Deployment
Viaarxiv icon

Monolingual Recognizers Fusion for Code-switching Speech Recognition

Nov 02, 2022
Tongtong Song, Qiang Xu, Haoyu Lu, Longbiao Wang, Hao Shi, Yuqin Lin, Yanbing Yang, Jianwu Dang

Figure 1 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 2 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 3 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Figure 4 for Monolingual Recognizers Fusion for Code-switching Speech Recognition
Viaarxiv icon

Small-footprint slimmable networks for keyword spotting

Apr 21, 2023
Zuhaib Akhtar, Mohammad Omar Khursheed, Dongsu Du, Yuzong Liu

Figure 1 for Small-footprint slimmable networks for keyword spotting
Figure 2 for Small-footprint slimmable networks for keyword spotting
Figure 3 for Small-footprint slimmable networks for keyword spotting
Figure 4 for Small-footprint slimmable networks for keyword spotting
Viaarxiv icon

An ASR-free Fluency Scoring Approach with Self-Supervised Learning

Feb 20, 2023
Wei Liu, Kaiqi Fu, Xiaohai Tian, Shuju Shi, Wei Li, Zejun Ma, Tan Lee

Figure 1 for An ASR-free Fluency Scoring Approach with Self-Supervised Learning
Figure 2 for An ASR-free Fluency Scoring Approach with Self-Supervised Learning
Figure 3 for An ASR-free Fluency Scoring Approach with Self-Supervised Learning
Figure 4 for An ASR-free Fluency Scoring Approach with Self-Supervised Learning
Viaarxiv icon

Assessing ASR Model Quality on Disordered Speech using BERTScore

Add code
Bookmark button
Alert button
Sep 21, 2022
Jimmy Tobin, Qisheng Li, Subhashini Venugopalan, Katie Seaver, Richard Cave, Katrin Tomanek

Figure 1 for Assessing ASR Model Quality on Disordered Speech using BERTScore
Figure 2 for Assessing ASR Model Quality on Disordered Speech using BERTScore
Figure 3 for Assessing ASR Model Quality on Disordered Speech using BERTScore
Figure 4 for Assessing ASR Model Quality on Disordered Speech using BERTScore
Viaarxiv icon

PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many Mapping

Add code
Bookmark button
Alert button
Nov 08, 2022
Junhyeok Lee, Seungu Han, Hyunjae Cho, Wonbin Jung

Figure 1 for PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many Mapping
Figure 2 for PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many Mapping
Figure 3 for PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many Mapping
Figure 4 for PhaseAug: A Differentiable Augmentation for Speech Synthesis to Simulate One-to-Many Mapping
Viaarxiv icon

Evaluating gesture-generation in a large-scale open challenge: The GENEA Challenge 2022

Add code
Bookmark button
Alert button
Mar 15, 2023
Taras Kucherenko, Pieter Wolfert, Youngwoo Yoon, Carla Viegas, Teodor Nikolov, Mihail Tsakov, Gustav Eje Henter

Figure 1 for Evaluating gesture-generation in a large-scale open challenge: The GENEA Challenge 2022
Figure 2 for Evaluating gesture-generation in a large-scale open challenge: The GENEA Challenge 2022
Figure 3 for Evaluating gesture-generation in a large-scale open challenge: The GENEA Challenge 2022
Figure 4 for Evaluating gesture-generation in a large-scale open challenge: The GENEA Challenge 2022
Viaarxiv icon

Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences

Add code
Bookmark button
Alert button
Mar 15, 2023
Yuan Tseng, Cheng-I Lai, Hung-yi Lee

Figure 1 for Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences
Figure 2 for Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences
Figure 3 for Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences
Figure 4 for Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences
Viaarxiv icon