Picture for Hanwang Zhang

Hanwang Zhang

Invariant Feature Regularization for Fair Face Recognition

Add code
Oct 23, 2023
Figure 1 for Invariant Feature Regularization for Fair Face Recognition
Figure 2 for Invariant Feature Regularization for Fair Face Recognition
Figure 3 for Invariant Feature Regularization for Fair Face Recognition
Figure 4 for Invariant Feature Regularization for Fair Face Recognition
Viaarxiv icon

Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models

Add code
Oct 12, 2023
Figure 1 for Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models
Figure 2 for Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models
Figure 3 for Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models
Figure 4 for Generalized Logit Adjustment: Calibrating Fine-tuned Models by Removing Label Bias in Foundation Models
Viaarxiv icon

Tuning Multi-mode Token-level Prompt Alignment across Modalities

Add code
Sep 25, 2023
Figure 1 for Tuning Multi-mode Token-level Prompt Alignment across Modalities
Figure 2 for Tuning Multi-mode Token-level Prompt Alignment across Modalities
Figure 3 for Tuning Multi-mode Token-level Prompt Alignment across Modalities
Figure 4 for Tuning Multi-mode Token-level Prompt Alignment across Modalities
Viaarxiv icon

Make the U in UDA Matter: Invariant Consistency Learning for Unsupervised Domain Adaptation

Add code
Sep 22, 2023
Viaarxiv icon

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention

Add code
Sep 17, 2023
Figure 1 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 2 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 3 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 4 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Viaarxiv icon

Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models

Add code
Aug 26, 2023
Viaarxiv icon

Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition

Add code
Aug 18, 2023
Figure 1 for Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition
Figure 2 for Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition
Figure 3 for Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition
Figure 4 for Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition
Viaarxiv icon

Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions

Add code
Aug 10, 2023
Figure 1 for Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
Figure 2 for Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
Figure 3 for Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
Figure 4 for Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions
Viaarxiv icon

Random Boxes Are Open-world Object Detectors

Add code
Jul 17, 2023
Figure 1 for Random Boxes Are Open-world Object Detectors
Figure 2 for Random Boxes Are Open-world Object Detectors
Figure 3 for Random Boxes Are Open-world Object Detectors
Figure 4 for Random Boxes Are Open-world Object Detectors
Viaarxiv icon

DisCo: Disentangled Control for Referring Human Dance Generation in Real World

Add code
Jun 30, 2023
Viaarxiv icon