Picture for Dongjune Lee

Dongjune Lee

Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction

Add code
Nov 08, 2023
Figure 1 for Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction
Figure 2 for Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction
Figure 3 for Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction
Figure 4 for Transduce and Speak: Neural Transducer for Text-to-Speech with Semantic Token Prediction
Viaarxiv icon

Fully Unsupervised Training of Few-shot Keyword Spotting

Add code
Oct 07, 2022
Figure 1 for Fully Unsupervised Training of Few-shot Keyword Spotting
Figure 2 for Fully Unsupervised Training of Few-shot Keyword Spotting
Figure 3 for Fully Unsupervised Training of Few-shot Keyword Spotting
Viaarxiv icon

Disentangled Speaker Representation Learning via Mutual Information Minimization

Add code
Aug 17, 2022
Figure 1 for Disentangled Speaker Representation Learning via Mutual Information Minimization
Figure 2 for Disentangled Speaker Representation Learning via Mutual Information Minimization
Figure 3 for Disentangled Speaker Representation Learning via Mutual Information Minimization
Figure 4 for Disentangled Speaker Representation Learning via Mutual Information Minimization
Viaarxiv icon

Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification

Add code
Dec 24, 2021
Figure 1 for Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Figure 2 for Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Figure 3 for Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Figure 4 for Bootstrap Equilibrium and Probabilistic Speaker Representation Learning for Self-supervised Speaker Verification
Viaarxiv icon