Picture for Yuki Mitsufuji

Yuki Mitsufuji

Concept-TRAK: Understanding how diffusion models learn concepts through concept-level attribution

Add code
Jul 09, 2025
Viaarxiv icon

Denoising Multi-Beta VAE: Representation Learning for Disentanglement and Generation

Add code
Jul 09, 2025
Viaarxiv icon

Fx-Encoder++: Extracting Instrument-Wise Audio Effects Representations from Mixtures

Add code
Jul 03, 2025
Viaarxiv icon

Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance

Add code
Jun 26, 2025
Viaarxiv icon

Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry

Add code
Jun 16, 2025
Viaarxiv icon

Can Large Language Models Predict Audio Effects Parameters from Natural Language?

Add code
May 27, 2025
Viaarxiv icon

A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?

Add code
May 26, 2025
Viaarxiv icon

SpecMaskFoley: Steering Pretrained Spectral Masked Generative Transformer Toward Synchronized Video-to-audio Synthesis via ControlNet

Add code
May 22, 2025
Viaarxiv icon

Improving Inference-Time Optimisation for Vocal Effects Style Transfer with a Gaussian Prior

Add code
May 16, 2025
Viaarxiv icon

Dyadic Mamba: Long-term Dyadic Human Motion Synthesis

Add code
May 14, 2025
Viaarxiv icon