Picture for Takashi Shibuya

Takashi Shibuya

AutoRefiner: Improving Autoregressive Video Diffusion Models via Reflective Refinement Over the Stochastic Sampling Path

Add code
Dec 15, 2025
Viaarxiv icon

Schrodinger Audio-Visual Editor: Object-Level Audiovisual Removal

Add code
Dec 14, 2025
Viaarxiv icon

Coherent Audio-Visual Editing via Conditional Audio Generation Following Video Edits

Add code
Dec 08, 2025
Viaarxiv icon

SONA: Learning Conditional, Unconditional, and Mismatching-Aware Discriminator

Add code
Oct 06, 2025
Viaarxiv icon

SoundReactor: Frame-level Online Video-to-Audio Generation

Add code
Oct 02, 2025
Figure 1 for SoundReactor: Frame-level Online Video-to-Audio Generation
Figure 2 for SoundReactor: Frame-level Online Video-to-Audio Generation
Figure 3 for SoundReactor: Frame-level Online Video-to-Audio Generation
Figure 4 for SoundReactor: Frame-level Online Video-to-Audio Generation
Viaarxiv icon

TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models

Add code
Aug 01, 2025
Viaarxiv icon

Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance

Add code
Jun 26, 2025
Viaarxiv icon

Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry

Add code
Jun 16, 2025
Viaarxiv icon

Dyadic Mamba: Long-term Dyadic Human Motion Synthesis

Add code
May 14, 2025
Viaarxiv icon

Forging and Removing Latent-Noise Diffusion Watermarks Using a Single Image

Add code
Apr 27, 2025
Viaarxiv icon