Picture for Yuki Mitsufuji

Yuki Mitsufuji

Echoes Over Time: Unlocking Length Generalization in Video-to-Audio Generation Models

Add code
Feb 25, 2026
Viaarxiv icon

GUDA: Counterfactual Group-wise Training Data Attribution for Diffusion Models via Unlearning

Add code
Jan 30, 2026
Viaarxiv icon

Summary of The Inaugural Music Source Restoration Challenge

Add code
Jan 07, 2026
Viaarxiv icon

Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

Add code
Jan 03, 2026
Viaarxiv icon

Do Foundational Audio Encoders Understand Music Structure?

Add code
Dec 19, 2025
Viaarxiv icon

AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Add code
Dec 15, 2025
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

PAVAS: Physics-Aware Video-to-Audio Synthesis

Add code
Dec 09, 2025
Figure 1 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 2 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 3 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 4 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Viaarxiv icon

Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits

Add code
Dec 08, 2025
Viaarxiv icon

MeanFlow Transformers with Representation Autoencoders

Add code
Nov 17, 2025
Viaarxiv icon