Picture for Junichi Yamagishi

Junichi Yamagishi

LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech

Add code
Oct 18, 2021
Figure 1 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Figure 2 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Figure 3 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Viaarxiv icon

Generalization Ability of MOS Prediction Networks

Add code
Oct 18, 2021
Figure 1 for Generalization Ability of MOS Prediction Networks
Figure 2 for Generalization Ability of MOS Prediction Networks
Figure 3 for Generalization Ability of MOS Prediction Networks
Figure 4 for Generalization Ability of MOS Prediction Networks
Viaarxiv icon

Revisiting Speech Content Privacy

Add code
Oct 13, 2021
Figure 1 for Revisiting Speech Content Privacy
Figure 2 for Revisiting Speech Content Privacy
Figure 3 for Revisiting Speech Content Privacy
Viaarxiv icon

LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example

Add code
Oct 11, 2021
Figure 1 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 2 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 3 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 4 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Viaarxiv icon

Estimating the confidence of speech spoofing countermeasure

Add code
Oct 10, 2021
Figure 1 for Estimating the confidence of speech spoofing countermeasure
Figure 2 for Estimating the confidence of speech spoofing countermeasure
Figure 3 for Estimating the confidence of speech spoofing countermeasure
Figure 4 for Estimating the confidence of speech spoofing countermeasure
Viaarxiv icon

On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis

Add code
Oct 04, 2021
Figure 1 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 2 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 3 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Figure 4 for On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis
Viaarxiv icon

DDS: A new device-degraded speech dataset for speech enhancement

Add code
Sep 28, 2021
Figure 1 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 2 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 3 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 4 for DDS: A new device-degraded speech dataset for speech enhancement
Viaarxiv icon

Master Face Attacks on Face Recognition Systems

Add code
Sep 08, 2021
Figure 1 for Master Face Attacks on Face Recognition Systems
Figure 2 for Master Face Attacks on Face Recognition Systems
Figure 3 for Master Face Attacks on Face Recognition Systems
Figure 4 for Master Face Attacks on Face Recognition Systems
Viaarxiv icon

The VoicePrivacy 2020 Challenge: Results and findings

Add code
Sep 01, 2021
Figure 1 for The VoicePrivacy 2020 Challenge: Results and findings
Figure 2 for The VoicePrivacy 2020 Challenge: Results and findings
Figure 3 for The VoicePrivacy 2020 Challenge: Results and findings
Figure 4 for The VoicePrivacy 2020 Challenge: Results and findings
Viaarxiv icon

ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection

Add code
Sep 01, 2021
Figure 1 for ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Figure 2 for ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Figure 3 for ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Figure 4 for ASVspoof 2021: accelerating progress in spoofed and deepfake speech detection
Viaarxiv icon