Alert button

"speech": models, code, and papers
Alert button

Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows

Nov 10, 2022
Abdelhamid Ezzerg, Thomas Merritt, Kayoko Yanagisawa, Piotr Bilinski, Magdalena Proszewska, Kamil Pokora, Renard Korzeniowski, Roberto Barra-Chicote, Daniel Korzekwa

Figure 1 for Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows
Figure 2 for Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows
Figure 3 for Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows
Figure 4 for Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows
Viaarxiv icon

mSLAM: Massively multilingual joint pre-training for speech and text

Feb 03, 2022
Ankur Bapna, Colin Cherry, Yu Zhang, Ye Jia, Melvin Johnson, Yong Cheng, Simran Khanuja, Jason Riesa, Alexis Conneau

Figure 1 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 2 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 3 for mSLAM: Massively multilingual joint pre-training for speech and text
Figure 4 for mSLAM: Massively multilingual joint pre-training for speech and text
Viaarxiv icon

A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing

Add code
Bookmark button
Alert button
Mar 18, 2022
He Bai, Renjie Zheng, Junkun Chen, Xintong Li, Mingbo Ma, Liang Huang

Figure 1 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 2 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 3 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Figure 4 for A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing
Viaarxiv icon

Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation

Add code
Bookmark button
Alert button
Apr 14, 2022
William Ravenscroft, Stefan Goetze, Thomas Hain

Figure 1 for Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation
Figure 2 for Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation
Figure 3 for Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation
Figure 4 for Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation
Viaarxiv icon

Predicting non-native speech perception using the Perceptual Assimilation Model and state-of-the-art acoustic models

Add code
Bookmark button
Alert button
May 31, 2022
Juliette Millet, Ioana Chitoran, Ewan Dunbar

Figure 1 for Predicting non-native speech perception using the Perceptual Assimilation Model and state-of-the-art acoustic models
Figure 2 for Predicting non-native speech perception using the Perceptual Assimilation Model and state-of-the-art acoustic models
Figure 3 for Predicting non-native speech perception using the Perceptual Assimilation Model and state-of-the-art acoustic models
Figure 4 for Predicting non-native speech perception using the Perceptual Assimilation Model and state-of-the-art acoustic models
Viaarxiv icon

Highly Generalizable Models for Multilingual Hate Speech Detection

Jan 27, 2022
Neha Deshpande, Nicholas Farris, Vidhur Kumar

Figure 1 for Highly Generalizable Models for Multilingual Hate Speech Detection
Figure 2 for Highly Generalizable Models for Multilingual Hate Speech Detection
Figure 3 for Highly Generalizable Models for Multilingual Hate Speech Detection
Figure 4 for Highly Generalizable Models for Multilingual Hate Speech Detection
Viaarxiv icon

Deep Learning Approach for Classifying the Aggressive Comments on Social Media: Machine Translated Data Vs Real Life Data

Mar 13, 2023
Mst Shapna Akter, Hossain Shahriar, Nova Ahmed, Alfredo Cuzzocrea

Figure 1 for Deep Learning Approach for Classifying the Aggressive Comments on Social Media: Machine Translated Data Vs Real Life Data
Figure 2 for Deep Learning Approach for Classifying the Aggressive Comments on Social Media: Machine Translated Data Vs Real Life Data
Figure 3 for Deep Learning Approach for Classifying the Aggressive Comments on Social Media: Machine Translated Data Vs Real Life Data
Figure 4 for Deep Learning Approach for Classifying the Aggressive Comments on Social Media: Machine Translated Data Vs Real Life Data
Viaarxiv icon

Multilingual Content Moderation: A Case Study on Reddit

Add code
Bookmark button
Alert button
Feb 19, 2023
Meng Ye, Karan Sikka, Katherine Atwell, Sabit Hassan, Ajay Divakaran, Malihe Alikhani

Figure 1 for Multilingual Content Moderation: A Case Study on Reddit
Figure 2 for Multilingual Content Moderation: A Case Study on Reddit
Figure 3 for Multilingual Content Moderation: A Case Study on Reddit
Figure 4 for Multilingual Content Moderation: A Case Study on Reddit
Viaarxiv icon

DDS: A new device-degraded speech dataset for speech enhancement

Add code
Bookmark button
Alert button
Sep 21, 2021
Haoyu Li, Junichi Yamagishi

Figure 1 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 2 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 3 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 4 for DDS: A new device-degraded speech dataset for speech enhancement
Viaarxiv icon