Alert button

"speech": models, code, and papers
Alert button

The Phonetic Footprint of Parkinson's Disease

Dec 21, 2021
Philipp Klumpp, Tomás Arias-Vergara, Juan Camilo Vásquez-Correa, Paula Andrea Pérez-Toro, Juan Rafael Orozco-Arroyave, Anton Batliner, Elmar Nöth

Figure 1 for The Phonetic Footprint of Parkinson's Disease
Figure 2 for The Phonetic Footprint of Parkinson's Disease
Figure 3 for The Phonetic Footprint of Parkinson's Disease
Figure 4 for The Phonetic Footprint of Parkinson's Disease
Viaarxiv icon

AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset

Add code
Bookmark button
Alert button
May 07, 2021
Mohamed Seghir Hadj Ameur, Hassina Aliane

Figure 1 for AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset
Figure 2 for AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset
Figure 3 for AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset
Figure 4 for AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset
Viaarxiv icon

Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech

Mar 03, 2021
Joerg Schmalenstroeer, Jens Heitkaemper, Joerg Ullmann, Reinhold Haeb-Umbach

Figure 1 for Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech
Figure 2 for Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech
Figure 3 for Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech
Figure 4 for Open Range Pitch Tracking for Carrier Frequency Difference Estimation from HF Transmitted Speech
Viaarxiv icon

R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS

Add code
Bookmark button
Alert button
Jun 30, 2022
Kyle Kastner, Aaron Courville

Figure 1 for R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS
Figure 2 for R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS
Figure 3 for R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS
Figure 4 for R-MelNet: Reduced Mel-Spectral Modeling for Neural TTS
Viaarxiv icon

More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations

Add code
Bookmark button
Alert button
Aug 19, 2021
Alessandro Ragano, Emmanouil Benetos, Andrew Hines

Figure 1 for More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Figure 2 for More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Figure 3 for More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Figure 4 for More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Viaarxiv icon

Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements

Oct 02, 2020
Arun Das, Jeffrey Mock, Henry Chacon, Farzan Irani, Edward Golob, Peyman Najafirad

Figure 1 for Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements
Figure 2 for Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements
Figure 3 for Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements
Figure 4 for Stuttering Speech Disfluency Prediction using Explainable Attribution Vectors of Facial Muscle Movements
Viaarxiv icon

Dialogue Enhancement and Listening Effort in Broadcast Audio: A Multimodal Evaluation

Aug 03, 2022
Matteo Torcoli, Thomas Robotham, Emanuël A. P. Habets

Figure 1 for Dialogue Enhancement and Listening Effort in Broadcast Audio: A Multimodal Evaluation
Figure 2 for Dialogue Enhancement and Listening Effort in Broadcast Audio: A Multimodal Evaluation
Figure 3 for Dialogue Enhancement and Listening Effort in Broadcast Audio: A Multimodal Evaluation
Viaarxiv icon

Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement

Mar 31, 2020
Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Xiaoli Ma, Chin-Hui Lee

Figure 1 for Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement
Figure 2 for Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement
Figure 3 for Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement
Figure 4 for Characterizing Speech Adversarial Examples Using Self-Attention U-Net Enhancement
Viaarxiv icon

Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation

Add code
Bookmark button
Alert button
Sep 15, 2021
Marco Gaido, Susana Rodríguez, Matteo Negri, Luisa Bentivogli, Marco Turchi

Figure 1 for Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation
Figure 2 for Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation
Figure 3 for Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation
Figure 4 for Is "moby dick" a Whale or a Bird? Named Entities and Terminology in Speech Translation
Viaarxiv icon

Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models

Feb 26, 2022
Samuel Thomas, Brian Kingsbury, George Saon, Hong-Kwang J. Kuo

Figure 1 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 2 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 3 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Figure 4 for Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models
Viaarxiv icon