Picture for Hanwang Zhang

Hanwang Zhang

Semantic Scene Completion with Cleaner Self

Add code
Mar 17, 2023
Figure 1 for Semantic Scene Completion with Cleaner Self
Figure 2 for Semantic Scene Completion with Cleaner Self
Figure 3 for Semantic Scene Completion with Cleaner Self
Figure 4 for Semantic Scene Completion with Cleaner Self
Viaarxiv icon

Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection

Add code
Feb 01, 2023
Figure 1 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 2 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 3 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Figure 4 for Compositional Prompt Tuning with Motion Cues for Open-vocabulary Video Relation Detection
Viaarxiv icon

Debiased Fine-Tuning for Vision-language Models by Prompt Regularization

Add code
Jan 29, 2023
Viaarxiv icon

Learning Trajectory-Word Alignments for Video-Language Tasks

Add code
Jan 06, 2023
Figure 1 for Learning Trajectory-Word Alignments for Video-Language Tasks
Figure 2 for Learning Trajectory-Word Alignments for Video-Language Tasks
Figure 3 for Learning Trajectory-Word Alignments for Video-Language Tasks
Figure 4 for Learning Trajectory-Word Alignments for Video-Language Tasks
Viaarxiv icon

Adaptively Clustering Neighbor Elements for Image Captioning

Add code
Jan 05, 2023
Viaarxiv icon

Evaluating and Mitigating Static Bias of Action Representations in the Background and the Foreground

Add code
Nov 23, 2022
Viaarxiv icon

Attention-based Class Activation Diffusion for Weakly-Supervised Semantic Segmentation

Add code
Nov 20, 2022
Viaarxiv icon

Respecting Transfer Gap in Knowledge Distillation

Add code
Oct 23, 2022
Viaarxiv icon

Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning

Add code
Oct 04, 2022
Figure 1 for Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Figure 2 for Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Figure 3 for Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Figure 4 for Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning
Viaarxiv icon

Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization

Add code
Aug 06, 2022
Figure 1 for Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization
Figure 2 for Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization
Figure 3 for Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization
Figure 4 for Class Is Invariant to Context and Vice Versa: On Learning Invariance for Out-Of-Distribution Generalization
Viaarxiv icon