Picture for Juhan Nam

Juhan Nam

TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling

Add code
Oct 02, 2025
Viaarxiv icon

Two Web Toolkits for Multimodal Piano Performance Dataset Acquisition and Fingering Annotation

Add code
Sep 18, 2025
Viaarxiv icon

PianoVAM: A Multimodal Piano Performance Dataset

Add code
Sep 10, 2025
Viaarxiv icon

PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music

Add code
Sep 04, 2025
Viaarxiv icon

Can Large Language Models Predict Audio Effects Parameters from Natural Language?

Add code
May 27, 2025
Figure 1 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 2 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 3 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 4 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Viaarxiv icon

Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System

Add code
May 22, 2025
Viaarxiv icon

KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation

Add code
Feb 21, 2025
Viaarxiv icon

TALKPLAY: Multimodal Music Recommendation with Large Language Models

Add code
Feb 20, 2025
Viaarxiv icon

FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation

Add code
Jan 18, 2025
Figure 1 for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
Figure 2 for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
Figure 3 for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation
Viaarxiv icon

D3RM: A Discrete Denoising Diffusion Refinement Model for Piano Transcription

Add code
Jan 09, 2025
Viaarxiv icon