Picture for Kyogu Lee

Kyogu Lee

Cross-Modal Bottleneck Fusion For Noise Robust Audio-Visual Speech Recognition

Add code
Feb 09, 2026
Viaarxiv icon

Rethinking Speech Representation Aggregation in Speech Enhancement: A Phonetic Mutual Information Perspective

Add code
Jan 30, 2026
Viaarxiv icon

Reverse Engineering of Music Mixing Graphs with Differentiable Processors and Iterative Pruning

Add code
Sep 19, 2025
Figure 1 for Reverse Engineering of Music Mixing Graphs with Differentiable Processors and Iterative Pruning
Figure 2 for Reverse Engineering of Music Mixing Graphs with Differentiable Processors and Iterative Pruning
Figure 3 for Reverse Engineering of Music Mixing Graphs with Differentiable Processors and Iterative Pruning
Figure 4 for Reverse Engineering of Music Mixing Graphs with Differentiable Processors and Iterative Pruning
Viaarxiv icon

Differentiable Acoustic Radiance Transfer

Add code
Sep 19, 2025
Viaarxiv icon

MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction

Add code
May 29, 2025
Figure 1 for MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
Figure 2 for MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
Figure 3 for MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
Figure 4 for MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction
Viaarxiv icon

DOSE : Drum One-Shot Extraction from Music Mixture

Add code
Apr 25, 2025
Viaarxiv icon

TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument

Add code
Feb 13, 2025
Viaarxiv icon

Song Form-aware Full-Song Text-to-Lyrics Generation with Multi-Level Granularity Syllable Count Control

Add code
Nov 20, 2024
Viaarxiv icon

Do Captioning Metrics Reflect Music Semantic Alignment?

Add code
Nov 18, 2024
Viaarxiv icon

VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression

Add code
Oct 12, 2024
Viaarxiv icon