Picture for Juhan Nam

Juhan Nam

D3PIA: A Discrete Denoising Diffusion Model for Piano Accompaniment Generation From Lead sheet

Add code
Feb 03, 2026
Viaarxiv icon

UNMIXX: Untangling Highly Correlated Singing Voices Mixtures

Add code
Jan 19, 2026
Viaarxiv icon

Twenty-Five Years of MIR Research: Achievements, Practices, Evaluations, and Future Challenges

Add code
Nov 10, 2025
Viaarxiv icon

TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling

Add code
Oct 02, 2025
Figure 1 for TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
Figure 2 for TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
Figure 3 for TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
Figure 4 for TalkPlay-Tools: Conversational Music Recommendation with LLM Tool Calling
Viaarxiv icon

Two Web Toolkits for Multimodal Piano Performance Dataset Acquisition and Fingering Annotation

Add code
Sep 18, 2025
Viaarxiv icon

PianoVAM: A Multimodal Piano Performance Dataset

Add code
Sep 10, 2025
Figure 1 for PianoVAM: A Multimodal Piano Performance Dataset
Figure 2 for PianoVAM: A Multimodal Piano Performance Dataset
Figure 3 for PianoVAM: A Multimodal Piano Performance Dataset
Figure 4 for PianoVAM: A Multimodal Piano Performance Dataset
Viaarxiv icon

PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music

Add code
Sep 04, 2025
Figure 1 for PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music
Figure 2 for PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music
Figure 3 for PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music
Figure 4 for PianoBind: A Multimodal Joint Embedding Model for Pop-piano Music
Viaarxiv icon

Can Large Language Models Predict Audio Effects Parameters from Natural Language?

Add code
May 27, 2025
Figure 1 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 2 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 3 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Figure 4 for Can Large Language Models Predict Audio Effects Parameters from Natural Language?
Viaarxiv icon

Dialogue in Resonance: An Interactive Music Piece for Piano and Real-Time Automatic Transcription System

Add code
May 22, 2025
Viaarxiv icon

KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation

Add code
Feb 21, 2025
Viaarxiv icon