Picture for Junichi Yamagishi

Junichi Yamagishi

Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting

Add code
Oct 08, 2023
Figure 1 for Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Figure 2 for Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Figure 3 for Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Figure 4 for Partial Rank Similarity Minimization Method for Quality MOS Prediction of Unseen Speech Synthesis Systems in Zero-Shot and Semi-supervised setting
Viaarxiv icon

The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

Add code
Oct 07, 2023
Figure 1 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 2 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 3 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 4 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Viaarxiv icon

How Close are Other Computer Vision Tasks to Deepfake Detection?

Add code
Oct 02, 2023
Viaarxiv icon

Spoofing attack augmentation: can differently-trained attack models improve generalisation?

Add code
Sep 18, 2023
Figure 1 for Spoofing attack augmentation: can differently-trained attack models improve generalisation?
Figure 2 for Spoofing attack augmentation: can differently-trained attack models improve generalisation?
Figure 3 for Spoofing attack augmentation: can differently-trained attack models improve generalisation?
Viaarxiv icon

DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input

Add code
Sep 14, 2023
Figure 1 for DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input
Figure 2 for DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input
Figure 3 for DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input
Figure 4 for DDSP-based Neural Waveform Synthesis of Polyphonic Guitar Performance from String-wise MIDI Input
Viaarxiv icon

SynVox2: Towards a privacy-friendly VoxCeleb2 dataset

Add code
Sep 12, 2023
Figure 1 for SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Figure 2 for SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Figure 3 for SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Figure 4 for SynVox2: Towards a privacy-friendly VoxCeleb2 dataset
Viaarxiv icon

Can large-scale vocoded spoofed data improve speech spoofing countermeasure with a self-supervised front end?

Add code
Sep 12, 2023
Viaarxiv icon

Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music

Add code
Jun 15, 2023
Figure 1 for Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Figure 2 for Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Figure 3 for Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Figure 4 for Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Viaarxiv icon

Towards single integrated spoofing-aware speaker verification embeddings

Add code
Jun 01, 2023
Figure 1 for Towards single integrated spoofing-aware speaker verification embeddings
Figure 2 for Towards single integrated spoofing-aware speaker verification embeddings
Figure 3 for Towards single integrated spoofing-aware speaker verification embeddings
Figure 4 for Towards single integrated spoofing-aware speaker verification embeddings
Viaarxiv icon

Language-independent speaker anonymization using orthogonal Householder neural network

Add code
May 30, 2023
Figure 1 for Language-independent speaker anonymization using orthogonal Householder neural network
Figure 2 for Language-independent speaker anonymization using orthogonal Householder neural network
Figure 3 for Language-independent speaker anonymization using orthogonal Householder neural network
Figure 4 for Language-independent speaker anonymization using orthogonal Householder neural network
Viaarxiv icon