Picture for Hanwang Zhang

Hanwang Zhang

Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models

Add code
Aug 26, 2023
Viaarxiv icon

Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition

Add code
Aug 18, 2023
Viaarxiv icon

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions

Add code
Aug 10, 2023
Viaarxiv icon

Random Boxes Are Open-world Object Detectors

Add code
Jul 17, 2023
Viaarxiv icon

DisCo: Disentangled Control for Referring Human Dance Generation in Real World

Add code
Jun 30, 2023
Viaarxiv icon

Fast Diffusion Model

Add code
Jun 12, 2023
Viaarxiv icon

An Overview of Challenges in Egocentric Text-Video Retrieval

Add code
Jun 07, 2023
Viaarxiv icon

Decoupled Kullback-Leibler Divergence Loss

Add code
May 23, 2023
Viaarxiv icon

Equivariant Similarity for Vision-Language Foundation Models

Add code
Mar 25, 2023
Viaarxiv icon

Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Add code
Mar 22, 2023
Viaarxiv icon