Alert button

"speech": models, code, and papers
Alert button

Speaker Identification Experiments Under Gender De-Identification

Mar 09, 2022
Marcos Faundez-Zanuy, Enric Sesa-Nogueras, Stefano Marinozzi

Figure 1 for Speaker Identification Experiments Under Gender De-Identification
Figure 2 for Speaker Identification Experiments Under Gender De-Identification
Figure 3 for Speaker Identification Experiments Under Gender De-Identification
Figure 4 for Speaker Identification Experiments Under Gender De-Identification
Viaarxiv icon

Convolutive Prediction for Reverberant Speech Separation

Aug 16, 2021
Zhong-Qiu Wang, Gordon Wichern, Jonathan Le Roux

Figure 1 for Convolutive Prediction for Reverberant Speech Separation
Figure 2 for Convolutive Prediction for Reverberant Speech Separation
Figure 3 for Convolutive Prediction for Reverberant Speech Separation
Viaarxiv icon

Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil

Add code
Bookmark button
Alert button
Sep 26, 2022
Eun Jung Yeo, Kwanghee Choi, Sunhee Kim, Minhwa Chung

Figure 1 for Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil
Figure 2 for Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil
Figure 3 for Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil
Figure 4 for Cross-lingual Dysarthria Severity Classification for English, Korean, and Tamil
Viaarxiv icon

Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings

Aug 20, 2022
Yile Wang, Yue Zhang

Figure 1 for Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Figure 2 for Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Figure 3 for Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Figure 4 for Lost in Context? On the Sense-wise Variance of Contextualized Word Embeddings
Viaarxiv icon

Speech Synthesis using EEG

Feb 22, 2020
Gautam Krishna, Co Tran, Yan Han, Mason Carnahan

Figure 1 for Speech Synthesis using EEG
Figure 2 for Speech Synthesis using EEG
Figure 3 for Speech Synthesis using EEG
Figure 4 for Speech Synthesis using EEG
Viaarxiv icon

Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement System

Aug 06, 2021
Jan Franzen, Tim Fingscheidt

Figure 1 for Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement System
Figure 2 for Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement System
Figure 3 for Deep Residual Echo Suppression and Noise Reduction: A Multi-Input FCRN Approach in a Hybrid Speech Enhancement System
Viaarxiv icon

End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning

Jul 07, 2021
Tomohiro Tanaka, Ryo Masumura, Mana Ihori, Akihiko Takashima, Shota Orihashi, Naoki Makishima

Figure 1 for End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning
Figure 2 for End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning
Figure 3 for End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning
Figure 4 for End-to-End Rich Transcription-Style Automatic Speech Recognition with Semi-Supervised Learning
Viaarxiv icon

Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments

Aug 27, 2022
Muskan Garg, Naveen Aggarwal

Figure 1 for Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments
Figure 2 for Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments
Figure 3 for Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments
Figure 4 for Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments
Viaarxiv icon

UPC's Speech Translation System for IWSLT 2021

Add code
Bookmark button
Alert button
May 10, 2021
Gerard I. Gállego, Ioannis Tsiamas, Carlos Escolano, José A. R. Fonollosa, Marta R. Costa-jussà

Figure 1 for UPC's Speech Translation System for IWSLT 2021
Figure 2 for UPC's Speech Translation System for IWSLT 2021
Figure 3 for UPC's Speech Translation System for IWSLT 2021
Figure 4 for UPC's Speech Translation System for IWSLT 2021
Viaarxiv icon

An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production

Mar 09, 2022
Anwesha Roy, Varun Belagali, Prasanta Kumar Ghosh

Figure 1 for An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production
Figure 2 for An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production
Figure 3 for An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production
Figure 4 for An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production
Viaarxiv icon