Picture for Junghyun Koo

Junghyun Koo

Break-the-Beat! Controllable MIDI-to-Drum Audio Synthesis

Add code
May 14, 2026
Viaarxiv icon

Automatic Music Mixing using a Generative Model of Effect Embeddings

Add code
Nov 11, 2025
Viaarxiv icon

Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution

Add code
Jul 09, 2025
Figure 1 for Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution
Figure 2 for Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution
Figure 3 for Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution
Figure 4 for Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution
Viaarxiv icon

Fx-Encoder++: Extracting Instrument-Wise Audio Effects Representations from Mixtures

Add code
Jul 03, 2025
Viaarxiv icon

Can Large Language Models Predict Audio Effects Parameters from Natural Language?

Add code
May 27, 2025
Figure 1 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 2 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 3 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 4 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Viaarxiv icon

Improving Inference-Time Optimisation for Vocal Effects Style Transfer with a Gaussian Prior

Add code
May 16, 2025
Viaarxiv icon

DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions

Add code
Apr 20, 2025
Figure 1 for DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions
Figure 2 for DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions
Figure 3 for DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions
Figure 4 for DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions
Viaarxiv icon

TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument

Add code
Feb 13, 2025
Viaarxiv icon

VRVQ: Variable Bitrate Residual Vector Quantization for Audio Compression

Add code
Oct 12, 2024
Viaarxiv icon

Variable Bitrate Residual Vector Quantization for Audio Coding

Add code
Oct 08, 2024
Viaarxiv icon