Picture for Tomoki Toda

Tomoki Toda

CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection

Add code
Jun 04, 2024
Figure 1 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 2 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 3 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Figure 4 for CtrSVDD: A Benchmark Dataset and Baseline Analysis for Controlled Singing Voice Deepfake Detection
Viaarxiv icon

Multi-speaker Text-to-speech Training with Speaker Anonymized Data

Add code
May 20, 2024
Viaarxiv icon

SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan

Add code
May 08, 2024
Figure 1 for SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan
Figure 2 for SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan
Figure 3 for SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan
Figure 4 for SVDD Challenge 2024: A Singing Voice Deepfake Detection Challenge Evaluation Plan
Viaarxiv icon

Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment

Add code
Apr 10, 2024
Figure 1 for Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment
Figure 2 for Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment
Figure 3 for Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment
Figure 4 for Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment
Viaarxiv icon

Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection

Add code
Mar 18, 2024
Figure 1 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 2 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 3 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 4 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Viaarxiv icon

Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment

Add code
Mar 10, 2024
Figure 1 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 2 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 3 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 4 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Viaarxiv icon

MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, ASR Error Detection, and ASR Error Correction

Add code
Jan 24, 2024
Viaarxiv icon

On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition

Add code
Nov 14, 2023
Viaarxiv icon

ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction

Add code
Oct 08, 2023
Figure 1 for ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Figure 2 for ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Figure 3 for ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Figure 4 for ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Viaarxiv icon

A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023

Add code
Oct 08, 2023
Figure 1 for A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Figure 2 for A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Figure 3 for A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Figure 4 for A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Viaarxiv icon