Picture for Tomoki Hayashi

Tomoki Hayashi

DiscreTalk: Text-to-Speech as a Machine Translation Problem

Add code
May 12, 2020
Figure 1 for DiscreTalk: Text-to-Speech as a Machine Translation Problem
Figure 2 for DiscreTalk: Text-to-Speech as a Machine Translation Problem
Figure 3 for DiscreTalk: Text-to-Speech as a Machine Translation Problem
Figure 4 for DiscreTalk: Text-to-Speech as a Machine Translation Problem
Viaarxiv icon

ESPnet-ST: All-in-One Speech Translation Toolkit

Add code
Apr 21, 2020
Figure 1 for ESPnet-ST: All-in-One Speech Translation Toolkit
Figure 2 for ESPnet-ST: All-in-One Speech Translation Toolkit
Figure 3 for ESPnet-ST: All-in-One Speech Translation Toolkit
Figure 4 for ESPnet-ST: All-in-One Speech Translation Toolkit
Viaarxiv icon

End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection

Add code
Feb 14, 2020
Figure 1 for End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection
Figure 2 for End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection
Figure 3 for End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection
Figure 4 for End-to-End Automatic Speech Recognition Integrated With CTC-Based Voice Activity Detection
Viaarxiv icon

Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining

Add code
Dec 14, 2019
Figure 1 for Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Figure 2 for Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Figure 3 for Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Figure 4 for Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining
Viaarxiv icon

ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit

Add code
Oct 24, 2019
Figure 1 for ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Figure 2 for ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Figure 3 for ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Figure 4 for ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit
Viaarxiv icon

A Comparative Study on Transformer vs RNN in Speech Applications

Add code
Sep 28, 2019
Figure 1 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 2 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 3 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 4 for A Comparative Study on Transformer vs RNN in Speech Applications
Viaarxiv icon

Non-Parallel Voice Conversion with Cyclic Variational Autoencoder

Add code
Jul 24, 2019
Figure 1 for Non-Parallel Voice Conversion with Cyclic Variational Autoencoder
Figure 2 for Non-Parallel Voice Conversion with Cyclic Variational Autoencoder
Figure 3 for Non-Parallel Voice Conversion with Cyclic Variational Autoencoder
Figure 4 for Non-Parallel Voice Conversion with Cyclic Variational Autoencoder
Viaarxiv icon

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion

Add code
Nov 27, 2018
Figure 1 for Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
Figure 2 for Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
Figure 3 for Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
Figure 4 for Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion
Viaarxiv icon

Cycle-consistency training for end-to-end speech recognition

Add code
Nov 02, 2018
Figure 1 for Cycle-consistency training for end-to-end speech recognition
Figure 2 for Cycle-consistency training for end-to-end speech recognition
Figure 3 for Cycle-consistency training for end-to-end speech recognition
Figure 4 for Cycle-consistency training for end-to-end speech recognition
Viaarxiv icon

Back-Translation-Style Data Augmentation for End-to-End ASR

Add code
Jul 28, 2018
Figure 1 for Back-Translation-Style Data Augmentation for End-to-End ASR
Figure 2 for Back-Translation-Style Data Augmentation for End-to-End ASR
Figure 3 for Back-Translation-Style Data Augmentation for End-to-End ASR
Figure 4 for Back-Translation-Style Data Augmentation for End-to-End ASR
Viaarxiv icon