Picture for Tomoki Toda

Tomoki Toda

Investigation of perceptual music similarity focusing on each instrumental part

Add code
Feb 04, 2025
Figure 1 for Investigation of perceptual music similarity focusing on each instrumental part
Figure 2 for Investigation of perceptual music similarity focusing on each instrumental part
Figure 3 for Investigation of perceptual music similarity focusing on each instrumental part
Figure 4 for Investigation of perceptual music similarity focusing on each instrumental part
Viaarxiv icon

Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation

Add code
Nov 11, 2024
Figure 1 for Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation
Figure 2 for Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation
Figure 3 for Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation
Figure 4 for Wavehax: Aliasing-Free Neural Waveform Synthesis Based on 2D Convolution and Harmonic Prior for Reliable Complex Spectrogram Estimation
Viaarxiv icon

MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models

Add code
Nov 06, 2024
Figure 1 for MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models
Figure 2 for MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models
Figure 3 for MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models
Figure 4 for MOS-Bench: Benchmarking Generalization Abilities of Subjective Speech Quality Assessment Models
Viaarxiv icon

Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions

Add code
Sep 29, 2024
Figure 1 for Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Figure 2 for Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Figure 3 for Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Figure 4 for Two-stage Framework for Robust Speech Emotion Recognition Using Target Speaker Extraction in Human Speech Noise Conditions
Viaarxiv icon

Improved Architecture for High-resolution Piano Transcription to Efficiently Capture Acoustic Characteristics of Music Signals

Add code
Sep 29, 2024
Viaarxiv icon

Improvements of Discriminative Feature Space Training for Anomalous Sound Detection in Unlabeled Conditions

Add code
Sep 14, 2024
Viaarxiv icon

The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction

Add code
Sep 11, 2024
Figure 1 for The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
Figure 2 for The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
Figure 3 for The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
Figure 4 for The VoiceMOS Challenge 2024: Beyond Speech Quality Prediction
Viaarxiv icon

SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge

Add code
Aug 28, 2024
Figure 1 for SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge
Figure 2 for SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge
Figure 3 for SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge
Figure 4 for SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge
Viaarxiv icon

2DP-2MRC: 2-Dimensional Pointer-based Machine Reading Comprehension Method for Multimodal Moment Retrieval

Add code
Jun 10, 2024
Viaarxiv icon

Quantifying the effect of speech pathology on automatic and human speaker verification

Add code
Jun 10, 2024
Figure 1 for Quantifying the effect of speech pathology on automatic and human speaker verification
Figure 2 for Quantifying the effect of speech pathology on automatic and human speaker verification
Figure 3 for Quantifying the effect of speech pathology on automatic and human speaker verification
Figure 4 for Quantifying the effect of speech pathology on automatic and human speaker verification
Viaarxiv icon