Picture for Erica Cooper

Erica Cooper

Investigating Range-Equalizing Bias in Mean Opinion Score Ratings of Synthesized Speech

Add code
May 22, 2023
Viaarxiv icon

Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms

Add code
May 18, 2023
Viaarxiv icon

Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?

Add code
Nov 25, 2022
Figure 1 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 2 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 3 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 4 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Viaarxiv icon

The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance

Add code
Apr 28, 2022
Figure 1 for The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance
Figure 2 for The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance
Figure 3 for The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance
Figure 4 for The PartialSpoof Database and Countermeasures for the Detection of Short Generated Audio Segments Embedded in a Speech Utterance
Viaarxiv icon

The VoiceMOS Challenge 2022

Add code
Mar 28, 2022
Figure 1 for The VoiceMOS Challenge 2022
Figure 2 for The VoiceMOS Challenge 2022
Figure 3 for The VoiceMOS Challenge 2022
Figure 4 for The VoiceMOS Challenge 2022
Viaarxiv icon

Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models

Add code
Mar 26, 2022
Figure 1 for Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Figure 2 for Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Figure 3 for Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Figure 4 for Language-Independent Speaker Anonymization Approach using Self-Supervised Pre-Trained Models
Viaarxiv icon

LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech

Add code
Oct 18, 2021
Figure 1 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Figure 2 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Figure 3 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Viaarxiv icon

Generalization Ability of MOS Prediction Networks

Add code
Oct 18, 2021
Figure 1 for Generalization Ability of MOS Prediction Networks
Figure 2 for Generalization Ability of MOS Prediction Networks
Figure 3 for Generalization Ability of MOS Prediction Networks
Figure 4 for Generalization Ability of MOS Prediction Networks
Viaarxiv icon

On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis

Add code
Oct 04, 2021
Figure 1 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 2 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 3 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 4 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Viaarxiv icon

Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection

Add code
Jul 29, 2021
Figure 1 for Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection
Figure 2 for Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection
Figure 3 for Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection
Figure 4 for Multi-Task Learning in Utterance-Level and Segmental-Level Spoof Detection
Viaarxiv icon