Picture for Alessandro Ragano

Alessandro Ragano

Dialogue Understandability: Why are we streaming movies with subtitles?

Add code
Mar 22, 2024
Viaarxiv icon

NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment

Add code
Sep 28, 2023
Figure 1 for NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment
Figure 2 for NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment
Figure 3 for NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment
Figure 4 for NOMAD: Unsupervised Learning of Perceptual Embeddings for Speech Enhancement and Non-matching Reference Audio Quality Assessment
Viaarxiv icon

Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models

Add code
Sep 22, 2023
Figure 1 for Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models
Figure 2 for Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models
Figure 3 for Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models
Figure 4 for Reduce, Reuse, Recycle: Is Perturbed Data better than Other Language augmentation for Low Resource Self-Supervised Speech Models
Viaarxiv icon

Learning Music Representations with wav2vec 2.0

Add code
Oct 27, 2022
Figure 1 for Learning Music Representations with wav2vec 2.0
Figure 2 for Learning Music Representations with wav2vec 2.0
Figure 3 for Learning Music Representations with wav2vec 2.0
Figure 4 for Learning Music Representations with wav2vec 2.0
Viaarxiv icon

Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset

Add code
Sep 14, 2022
Figure 1 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Figure 2 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Figure 3 for Using Rater and System Metadata to Explain Variance in the VoiceMOS Challenge 2022 Dataset
Viaarxiv icon

A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality

Add code
Apr 05, 2022
Figure 1 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 2 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 3 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Figure 4 for A Comparison of Deep Learning MOS Predictors for Speech Synthesis Quality
Viaarxiv icon

Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction

Add code
Apr 05, 2022
Figure 1 for Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
Figure 2 for Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
Figure 3 for Exploring the influence of fine-tuning data on wav2vec 2.0 model for blind speech quality prediction
Viaarxiv icon

AQP: An Open Modular Python Platform for Objective Speech and Audio Quality Metrics

Add code
Oct 26, 2021
Figure 1 for AQP: An Open Modular Python Platform for Objective Speech and Audio Quality Metrics
Figure 2 for AQP: An Open Modular Python Platform for Objective Speech and Audio Quality Metrics
Figure 3 for AQP: An Open Modular Python Platform for Objective Speech and Audio Quality Metrics
Viaarxiv icon

More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations

Add code
Aug 19, 2021
Figure 1 for More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Figure 2 for More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Figure 3 for More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Figure 4 for More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations
Viaarxiv icon

Audio Impairment Recognition Using a Correlation-Based Feature Representation

Add code
Mar 24, 2020
Figure 1 for Audio Impairment Recognition Using a Correlation-Based Feature Representation
Figure 2 for Audio Impairment Recognition Using a Correlation-Based Feature Representation
Figure 3 for Audio Impairment Recognition Using a Correlation-Based Feature Representation
Figure 4 for Audio Impairment Recognition Using a Correlation-Based Feature Representation
Viaarxiv icon