Picture for Youngmoon Jung

Youngmoon Jung

Triage knowledge distillation for speaker verification

Add code
Jan 21, 2026
Viaarxiv icon

MATE: Matryoshka Audio-Text Embeddings for Open-Vocabulary Keyword Spotting

Add code
Jan 20, 2026
Viaarxiv icon

DAME: Duration-Aware Matryoshka Embedding for Duration-Robust Speaker Verification

Add code
Jan 20, 2026
Viaarxiv icon

Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting

Add code
May 22, 2025
Figure 1 for Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
Figure 2 for Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
Figure 3 for Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
Figure 4 for Adversarial Deep Metric Learning for Cross-Modal Audio-Text Alignment in Open-Vocabulary Keyword Spotting
Viaarxiv icon

Text-Aware Adapter for Few-Shot Keyword Spotting

Add code
Dec 24, 2024
Figure 1 for Text-Aware Adapter for Few-Shot Keyword Spotting
Figure 2 for Text-Aware Adapter for Few-Shot Keyword Spotting
Figure 3 for Text-Aware Adapter for Few-Shot Keyword Spotting
Figure 4 for Text-Aware Adapter for Few-Shot Keyword Spotting
Viaarxiv icon

CTC-aligned Audio-Text Embedding for Streaming Open-vocabulary Keyword Spotting

Add code
Jun 12, 2024
Viaarxiv icon

Relational Proxy Loss for Audio-Text based Keyword Spotting

Add code
Jun 08, 2024
Figure 1 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 2 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 3 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Figure 4 for Relational Proxy Loss for Audio-Text based Keyword Spotting
Viaarxiv icon

FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning

Add code
Jul 01, 2022
Figure 1 for FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning
Figure 2 for FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning
Figure 3 for FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning
Figure 4 for FitHuBERT: Going Thinner and Deeper for Knowledge Distillation of Speech Self-Supervised Learning
Viaarxiv icon

Perceptually Guided End-to-End Text-to-Speech

Add code
Nov 02, 2020
Figure 1 for Perceptually Guided End-to-End Text-to-Speech
Figure 2 for Perceptually Guided End-to-End Text-to-Speech
Figure 3 for Perceptually Guided End-to-End Text-to-Speech
Viaarxiv icon

A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments

Add code
Oct 06, 2020
Figure 1 for A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Figure 2 for A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Figure 3 for A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Figure 4 for A Unified Deep Learning Framework for Short-Duration Speaker Verification in Adverse Environments
Viaarxiv icon