Picture for Hanwang Zhang

Hanwang Zhang

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention

Add code
Sep 17, 2023
Figure 1 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 2 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 3 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 4 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Viaarxiv icon

Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models

Add code
Aug 26, 2023
Viaarxiv icon

Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition

Add code
Aug 18, 2023
Viaarxiv icon

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions

Add code
Aug 10, 2023
Figure 1 for Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
Figure 2 for Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
Figure 3 for Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
Figure 4 for Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
Viaarxiv icon

Random Boxes Are Open-world Object Detectors

Add code
Jul 17, 2023
Figure 1 for Random Boxes Are Open-world Object Detectors
Figure 2 for Random Boxes Are Open-world Object Detectors
Figure 3 for Random Boxes Are Open-world Object Detectors
Figure 4 for Random Boxes Are Open-world Object Detectors
Viaarxiv icon

DisCo: Disentangled Control for Referring Human Dance Generation in Real World

Add code
Jun 30, 2023
Viaarxiv icon

Fast Diffusion Model

Add code
Jun 12, 2023
Figure 1 for Fast Diffusion Model
Figure 2 for Fast Diffusion Model
Figure 3 for Fast Diffusion Model
Figure 4 for Fast Diffusion Model
Viaarxiv icon

An Overview of Challenges in Egocentric Text-Video Retrieval

Add code
Jun 07, 2023
Figure 1 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 2 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 3 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 4 for An Overview of Challenges in Egocentric Text-Video Retrieval
Viaarxiv icon

Decoupled Kullback-Leibler Divergence Loss

Add code
May 23, 2023
Viaarxiv icon

Equivariant Similarity for Vision-Language Foundation Models

Add code
Mar 25, 2023
Viaarxiv icon