Picture for Takashi Shibuya

Takashi Shibuya

SONA: Learning Conditional, Unconditional, and Mismatching-Aware Discriminator

Add code
Oct 06, 2025
Viaarxiv icon

SoundReactor: Frame-level Online Video-to-Audio Generation

Add code
Oct 02, 2025
Viaarxiv icon

TITAN-Guide: Taming Inference-Time AligNment for Guided Text-to-Video Diffusion Models

Add code
Aug 01, 2025
Viaarxiv icon

Step-by-Step Video-to-Audio Synthesis via Negative Audio Guidance

Add code
Jun 26, 2025
Viaarxiv icon

Vid-CamEdit: Video Camera Trajectory Editing with Generative Rendering from Estimated Geometry

Add code
Jun 16, 2025
Viaarxiv icon

Dyadic Mamba: Long-term Dyadic Human Motion Synthesis

Add code
May 14, 2025
Viaarxiv icon

Forging and Removing Latent-Noise Diffusion Watermarks Using a Single Image

Add code
Apr 27, 2025
Viaarxiv icon

HumanGif: Single-View Human Diffusion with Generative Prior

Add code
Feb 17, 2025
Figure 1 for HumanGif: Single-View Human Diffusion with Generative Prior
Figure 2 for HumanGif: Single-View Human Diffusion with Generative Prior
Figure 3 for HumanGif: Single-View Human Diffusion with Generative Prior
Figure 4 for HumanGif: Single-View Human Diffusion with Generative Prior
Viaarxiv icon

CCStereo: Audio-Visual Contextual and Contrastive Learning for Binaural Audio Generation

Add code
Jan 06, 2025
Viaarxiv icon

Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

Add code
Dec 19, 2024
Viaarxiv icon