Picture for Juhan Nam

Juhan Nam

PianoVAM: A Multimodal Piano Performance Dataset

Add code
Sep 10, 2025
Viaarxiv icon

PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music

Add code
Sep 04, 2025
Viaarxiv icon

Can Large Language Models Predict Audio Effects Parameters from Natural Language?

Add code
May 27, 2025
Viaarxiv icon

Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System

Add code
May 22, 2025
Viaarxiv icon

KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation

Add code
Feb 21, 2025
Viaarxiv icon

TALKPLAY: Multimodal Music Recommendation with Large Language Models

Add code
Feb 20, 2025
Viaarxiv icon

FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation

Add code
Jan 18, 2025
Viaarxiv icon

D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription

Add code
Jan 09, 2025
Viaarxiv icon

Predicting User Intents and Musical Attributes from Music Discovery Conversations

Add code
Nov 20, 2024
Viaarxiv icon

Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models

Add code
Nov 11, 2024
Viaarxiv icon