Picture for Yuki Mitsufuji

Yuki Mitsufuji

Understanding and Accelerating the Training of Masked Diffusion Language Models

Add code
May 13, 2026
Viaarxiv icon

Woosh: A Sound Effects Foundation Model

Add code
Apr 02, 2026
Viaarxiv icon

Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models

Add code
Feb 25, 2026
Viaarxiv icon

GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning

Add code
Jan 30, 2026
Viaarxiv icon

Summary of The Inaugural Music Source Restoration Challenge

Add code
Jan 07, 2026
Viaarxiv icon

Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

Add code
Jan 03, 2026
Viaarxiv icon

Do Foundational Audio Encoders Understand Music Structure?

Add code
Dec 19, 2025
Viaarxiv icon

AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Add code
Dec 15, 2025
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

PAVAS: Physics-Aware Video-to-Audio Synthesis

Add code
Dec 09, 2025
Figure 1 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 2 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 3 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 4 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Viaarxiv icon