Alert button

"speech": models, code, and papers
Alert button

Neural Architecture Search with Multimodal Fusion Methods for Diagnosing Dementia

Feb 12, 2023
Michail Chatzianastasis, Loukas Ilias, Dimitris Askounis, Michalis Vazirgiannis

Figure 1 for Neural Architecture Search with Multimodal Fusion Methods for Diagnosing Dementia
Figure 2 for Neural Architecture Search with Multimodal Fusion Methods for Diagnosing Dementia
Figure 3 for Neural Architecture Search with Multimodal Fusion Methods for Diagnosing Dementia
Viaarxiv icon

Right the docs: Characterising voice dataset documentation practices used in machine learning

Add code
Bookmark button
Alert button
Mar 19, 2023
Kathy Reid, Elizabeth T. Williams

Figure 1 for Right the docs: Characterising voice dataset documentation practices used in machine learning
Figure 2 for Right the docs: Characterising voice dataset documentation practices used in machine learning
Figure 3 for Right the docs: Characterising voice dataset documentation practices used in machine learning
Figure 4 for Right the docs: Characterising voice dataset documentation practices used in machine learning
Viaarxiv icon

Open Challenges in Synthetic Speech Detection

Sep 15, 2022
Luca Cuccovillo, Christoforos Papastergiopoulos, Anastasios Vafeiadis, Artem Yaroshchuk, Patrick Aichroth, Konstantinos Votis, Dimitrios Tzovaras

Figure 1 for Open Challenges in Synthetic Speech Detection
Viaarxiv icon

Residual Information in Deep Speaker Embedding Architectures

Add code
Bookmark button
Alert button
Feb 06, 2023
Adriana Stan

Figure 1 for Residual Information in Deep Speaker Embedding Architectures
Figure 2 for Residual Information in Deep Speaker Embedding Architectures
Figure 3 for Residual Information in Deep Speaker Embedding Architectures
Figure 4 for Residual Information in Deep Speaker Embedding Architectures
Viaarxiv icon

Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels

Add code
Bookmark button
Alert button
Mar 30, 2023
Tzeviya Sylvia Fuchs, Yedid Hoshen

Figure 1 for Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels
Figure 2 for Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels
Figure 3 for Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels
Figure 4 for Unsupervised Word Segmentation Using Temporal Gradient Pseudo-Labels
Viaarxiv icon

Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis

Add code
Bookmark button
Alert button
Sep 14, 2022
Yukun Peng, Zhenhua Ling

Figure 1 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 2 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 3 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 4 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Viaarxiv icon

NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling

Add code
Bookmark button
Alert button
Jun 18, 2022
Chi-Chang Lee, Cheng-Hung Hu, Yu-Chen Lin, Chu-Song Chen, Hsin-Min Wang, Yu Tsao

Figure 1 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 2 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 3 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 4 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Viaarxiv icon

TaylorAECNet: A Taylor Style Neural Network for Full-Band Echo Cancellation

Mar 11, 2023
Weiming Xu, Zhihao Guo

Figure 1 for TaylorAECNet: A Taylor Style Neural Network for Full-Band Echo Cancellation
Figure 2 for TaylorAECNet: A Taylor Style Neural Network for Full-Band Echo Cancellation
Figure 3 for TaylorAECNet: A Taylor Style Neural Network for Full-Band Echo Cancellation
Figure 4 for TaylorAECNet: A Taylor Style Neural Network for Full-Band Echo Cancellation
Viaarxiv icon

BERT-based Ensemble Approaches for Hate Speech Detection

Sep 15, 2022
Khouloud Mnassri, Praboda Rajapaksha, Reza Farahbakhsh, Noel Crespi

Figure 1 for BERT-based Ensemble Approaches for Hate Speech Detection
Figure 2 for BERT-based Ensemble Approaches for Hate Speech Detection
Figure 3 for BERT-based Ensemble Approaches for Hate Speech Detection
Figure 4 for BERT-based Ensemble Approaches for Hate Speech Detection
Viaarxiv icon

Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents

Apr 03, 2022
Priyank Dubey, Bilal Shah

Viaarxiv icon