Picture for Yuki Mitsufuji

Yuki Mitsufuji

Summary of The Inaugural Music Source Restoration Challenge

Add code
Jan 07, 2026
Viaarxiv icon

Improved Object-Centric Diffusion Learning with Registers and Contrastive Alignment

Add code
Jan 03, 2026
Viaarxiv icon

Do Foundational Audio Encoders Understand Music Structure?

Add code
Dec 19, 2025
Viaarxiv icon

AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Add code
Dec 15, 2025
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

PAVAS: Physics-Aware Video-to-Audio Synthesis

Add code
Dec 09, 2025
Figure 1 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 2 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 3 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Figure 4 for PAVAS: Physics-Aware Video-to-Audio Synthesis
Viaarxiv icon

Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits

Add code
Dec 08, 2025
Viaarxiv icon

FoleyBench: A Benchmark For Video-to-Audio Models

Add code
Nov 17, 2025
Figure 1 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 2 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 3 for FoleyBench: A Benchmark For Video-to-Audio Models
Figure 4 for FoleyBench: A Benchmark For Video-to-Audio Models
Viaarxiv icon

MeanFlow Transformers with Representation Autoencoders

Add code
Nov 17, 2025
Viaarxiv icon

Automatic Music Mixing using a Generative Model of Effect Embeddings

Add code
Nov 11, 2025
Viaarxiv icon