Picture for Shijian Deng

Shijian Deng

AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation

Add code
Jun 11, 2024
Viaarxiv icon

Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation

Add code
Oct 18, 2023
Viaarxiv icon

Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA

Add code
May 31, 2023
Viaarxiv icon