Picture for Eesung Kim

Eesung Kim

Peeking Into The Future For Contextual Biasing

Add code
Dec 19, 2025
Viaarxiv icon

Enhanced Hybrid Transducer and Attention Encoder Decoder with Text Data

Add code
Jun 23, 2025
Viaarxiv icon

Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning

Add code
Apr 08, 2022
Figure 1 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 2 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 3 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Figure 4 for Automatic Pronunciation Assessment using Self-Supervised Speech Representation Learning
Viaarxiv icon

JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech

Add code
Mar 31, 2022
Figure 1 for JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
Figure 2 for JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
Figure 3 for JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
Viaarxiv icon

Multitask Learning and Joint Optimization for Transformer-RNN-Transducer Speech Recognition

Add code
Nov 02, 2020
Figure 1 for Multitask Learning and Joint Optimization for Transformer-RNN-Transducer Speech Recognition
Figure 2 for Multitask Learning and Joint Optimization for Transformer-RNN-Transducer Speech Recognition
Figure 3 for Multitask Learning and Joint Optimization for Transformer-RNN-Transducer Speech Recognition
Figure 4 for Multitask Learning and Joint Optimization for Transformer-RNN-Transducer Speech Recognition
Viaarxiv icon