Picture for Erica Cooper

Erica Cooper

Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction

Add code
Dec 25, 2023
Figure 1 for Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction
Figure 2 for Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction
Figure 3 for Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction
Figure 4 for Uncertainty as a Predictor: Leveraging Self-Supervised Learning for Zero-Shot MOS Prediction
Viaarxiv icon

ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations

Add code
Dec 22, 2023
Figure 1 for ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Figure 2 for ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Figure 3 for ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Figure 4 for ZMM-TTS: Zero-shot Multilingual and Multispeaker Speech Synthesis Conditioned on Self-supervised Discrete Speech Representations
Viaarxiv icon

Speaker-Text Retrieval via Contrastive Learning

Add code
Dec 11, 2023
Viaarxiv icon

Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting

Add code
Oct 08, 2023
Figure 1 for Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Figure 2 for Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Figure 3 for Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Figure 4 for Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Viaarxiv icon

The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

Add code
Oct 07, 2023
Figure 1 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 2 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 3 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 4 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Viaarxiv icon

DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input

Add code
Sep 14, 2023
Viaarxiv icon

SynVox2: Towards a privacy-friendly VoxCeleb2 dataset

Add code
Sep 12, 2023
Figure 1 for SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Figure 2 for SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Figure 3 for SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Figure 4 for SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Viaarxiv icon

Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music

Add code
Jun 15, 2023
Viaarxiv icon

Language-independent speaker anonymization using orthogonal Householder neural network

Add code
May 30, 2023
Figure 1 for Language-independent speaker anonymization using orthogonal Householder neural network
Figure 2 for Language-independent speaker anonymization using orthogonal Householder neural network
Figure 3 for Language-independent speaker anonymization using orthogonal Householder neural network
Figure 4 for Language-independent speaker anonymization using orthogonal Householder neural network
Viaarxiv icon

Range-Based Equal Error Rate for Spoof Localization

Add code
May 28, 2023
Figure 1 for Range-Based Equal Error Rate for Spoof Localization
Figure 2 for Range-Based Equal Error Rate for Spoof Localization
Figure 3 for Range-Based Equal Error Rate for Spoof Localization
Figure 4 for Range-Based Equal Error Rate for Spoof Localization
Viaarxiv icon