Picture for Nobuaki Minematsu

Nobuaki Minematsu

A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora

Add code
Jul 16, 2024
Viaarxiv icon

Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music

Add code
Jun 15, 2023
Figure 1 for Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Figure 2 for Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Figure 3 for Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Figure 4 for Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music
Viaarxiv icon

Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition

Add code
Apr 08, 2022
Figure 1 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 2 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 3 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 4 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Viaarxiv icon

Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder

Add code
Jul 31, 2018
Figure 1 for Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Figure 2 for Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Figure 3 for Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Figure 4 for Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder
Viaarxiv icon