Picture for Hanwang Zhang

Hanwang Zhang

Fast Diffusion Model

Add code
Jun 12, 2023
Figure 1 for Fast Diffusion Model
Figure 2 for Fast Diffusion Model
Figure 3 for Fast Diffusion Model
Figure 4 for Fast Diffusion Model
Viaarxiv icon

An Overview of Challenges in Egocentric Text-Video Retrieval

Add code
Jun 07, 2023
Figure 1 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 2 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 3 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 4 for An Overview of Challenges in Egocentric Text-Video Retrieval
Viaarxiv icon

Decoupled Kullback-Leibler Divergence Loss

Add code
May 23, 2023
Viaarxiv icon

Equivariant Similarity for Vision-Language Foundation Models

Add code
Mar 25, 2023
Viaarxiv icon

Unbiased Multiple Instance Learning for Weakly Supervised Video Anomaly Detection

Add code
Mar 22, 2023
Viaarxiv icon

Semantic Scene Completion with Cleaner Self

Add code
Mar 17, 2023
Figure 1 for Semantic Scene Completion with Cleaner Self
Figure 2 for Semantic Scene Completion with Cleaner Self
Figure 3 for Semantic Scene Completion with Cleaner Self
Figure 4 for Semantic Scene Completion with Cleaner Self
Viaarxiv icon

Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection

Add code
Feb 01, 2023
Figure 1 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 2 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 3 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 4 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Viaarxiv icon

Debiased Fine-Tuning for Vision-language Models by Prompt Regularization

Add code
Jan 29, 2023
Viaarxiv icon

Learning Trajectory-Word Alignments for Video-Language Tasks

Add code
Jan 06, 2023
Figure 1 for Learning Trajectory-Word Alignments for Video-Language Tasks
Figure 2 for Learning Trajectory-Word Alignments for Video-Language Tasks
Figure 3 for Learning Trajectory-Word Alignments for Video-Language Tasks
Figure 4 for Learning Trajectory-Word Alignments for Video-Language Tasks
Viaarxiv icon

Adaptively Clustering Neighbor Elements for Image Captioning

Add code
Jan 05, 2023
Viaarxiv icon