Picture for Minje Kim

Minje Kim

HEART-PFL: Stable Personalized Federated Learning under Heterogeneity with Hierarchical Directional Alignment and Adversarial Knowledge Transfer

Add code
Mar 25, 2026
Viaarxiv icon

Something from Nothing: Data Augmentation for Robust Severity Level Estimation of Dysarthric Speech

Add code
Mar 16, 2026
Viaarxiv icon

Gencho: Room Impulse Response Generation from Reverberant Speech and Text via Diffusion Transformers

Add code
Feb 09, 2026
Viaarxiv icon

From Hallucination to Articulation: Language Model-Driven Losses for Ultra Low-Bitrate Neural Speech Coding

Add code
Feb 05, 2026
Viaarxiv icon

Semantics-Aware Generative Latent Data Augmentation for Learning in Low-Resource Domains

Add code
Feb 02, 2026
Viaarxiv icon

PromptSep: Generative Audio Separation via Multimodal Prompting

Add code
Nov 06, 2025
Viaarxiv icon

User-guided Generative Source Separation

Add code
Jul 02, 2025
Figure 1 for User-guided Generative Source Separation
Figure 2 for User-guided Generative Source Separation
Figure 3 for User-guided Generative Source Separation
Figure 4 for User-guided Generative Source Separation
Viaarxiv icon

Discrete Audio Tokens: More Than a Survey!

Add code
Jun 12, 2025
Viaarxiv icon

Perceptual Audio Coding: A 40-Year Historical Perspective

Add code
Apr 22, 2025
Figure 1 for Perceptual Audio Coding: A 40-Year Historical Perspective
Figure 2 for Perceptual Audio Coding: A 40-Year Historical Perspective
Figure 3 for Perceptual Audio Coding: A 40-Year Historical Perspective
Figure 4 for Perceptual Audio Coding: A 40-Year Historical Perspective
Viaarxiv icon

DTA: Dual Temporal-channel-wise Attention for Spiking Neural Networks

Add code
Mar 13, 2025
Viaarxiv icon