Alert button

"speech": models, code, and papers
Alert button

Pathological voice adaptation with autoencoder-based voice conversion

Add code
Bookmark button
Alert button
Jun 15, 2021
Marc Illa, Bence Mark Halpern, Rob van Son, Laureano Moro-Velazquez, Odette Scharenborg

Figure 1 for Pathological voice adaptation with autoencoder-based voice conversion
Figure 2 for Pathological voice adaptation with autoencoder-based voice conversion
Figure 3 for Pathological voice adaptation with autoencoder-based voice conversion
Figure 4 for Pathological voice adaptation with autoencoder-based voice conversion
Viaarxiv icon

Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework

Oct 29, 2020
Dhruv Guliani, Francoise Beaufays, Giovanni Motta

Figure 1 for Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Figure 2 for Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Figure 3 for Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Figure 4 for Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Viaarxiv icon

TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation

Add code
Bookmark button
Alert button
Sep 21, 2018
Yi Luo, Nima Mesgarani

Figure 1 for TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation
Figure 2 for TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation
Figure 3 for TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation
Figure 4 for TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation
Viaarxiv icon

Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models

Feb 19, 2020
Andreas Krug, Sebastian Stober

Figure 1 for Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models
Figure 2 for Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models
Figure 3 for Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models
Figure 4 for Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models
Viaarxiv icon

Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors

Add code
Bookmark button
Alert button
Apr 05, 2022
Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko

Figure 1 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 2 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 3 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Figure 4 for Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation Vectors
Viaarxiv icon

Phase-aware Speech Enhancement with Deep Complex U-Net

Add code
Bookmark button
Alert button
Apr 02, 2019
Hyeong-Seok Choi, Jang-Hyun Kim, Jaesung Huh, Adrian Kim, Jung-Woo Ha, Kyogu Lee

Figure 1 for Phase-aware Speech Enhancement with Deep Complex U-Net
Figure 2 for Phase-aware Speech Enhancement with Deep Complex U-Net
Figure 3 for Phase-aware Speech Enhancement with Deep Complex U-Net
Figure 4 for Phase-aware Speech Enhancement with Deep Complex U-Net
Viaarxiv icon

GraphTTS: graph-to-sequence modelling in neural text-to-speech

Mar 04, 2020
Aolan Sun, Jianzong Wang, Ning Cheng, Huayi Peng, Zhen Zeng, Jing Xiao

Figure 1 for GraphTTS: graph-to-sequence modelling in neural text-to-speech
Figure 2 for GraphTTS: graph-to-sequence modelling in neural text-to-speech
Figure 3 for GraphTTS: graph-to-sequence modelling in neural text-to-speech
Figure 4 for GraphTTS: graph-to-sequence modelling in neural text-to-speech
Viaarxiv icon

BUT Opensat 2019 Speech Recognition System

Jan 30, 2020
Martin Karafiát, Murali Karthick Baskar, Igor Szöke, Hari Krishna Vydana, Karel Veselý, Jan "Honza'' Černocký

Figure 1 for BUT Opensat 2019 Speech Recognition System
Figure 2 for BUT Opensat 2019 Speech Recognition System
Figure 3 for BUT Opensat 2019 Speech Recognition System
Figure 4 for BUT Opensat 2019 Speech Recognition System
Viaarxiv icon

Noisy-to-Noisy Voice Conversion Framework with Denoising Model

Sep 22, 2021
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

Figure 1 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 2 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 3 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 4 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Viaarxiv icon

Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition

Jun 05, 2019
Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 2 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 3 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Figure 4 for Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition
Viaarxiv icon