Picture for Kai Zhu

Kai Zhu

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Add code
Jul 03, 2024
Viaarxiv icon

ViViD: Video Virtual Try-on using Diffusion Models

Add code
May 20, 2024
Figure 1 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 2 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 3 for ViViD: Video Virtual Try-on using Diffusion Models
Figure 4 for ViViD: Video Virtual Try-on using Diffusion Models
Viaarxiv icon

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

Add code
Apr 17, 2024
Viaarxiv icon

Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation

Add code
Mar 22, 2024
Figure 1 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 2 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 3 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Figure 4 for Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation
Viaarxiv icon

Intention-driven Ego-to-Exo Video Generation

Add code
Mar 17, 2024
Figure 1 for Intention-driven Ego-to-Exo Video Generation
Figure 2 for Intention-driven Ego-to-Exo Video Generation
Figure 3 for Intention-driven Ego-to-Exo Video Generation
Figure 4 for Intention-driven Ego-to-Exo Video Generation
Viaarxiv icon

CCM: Adding Conditional Controls to Text-to-Image Consistency Models

Add code
Dec 12, 2023
Figure 1 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 2 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 3 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Figure 4 for CCM: Adding Conditional Controls to Text-to-Image Consistency Models
Viaarxiv icon

Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection

Add code
Dec 04, 2023
Figure 1 for Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection
Figure 2 for Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection
Figure 3 for Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection
Figure 4 for Likelihood-Aware Semantic Alignment for Full-Spectrum Out-of-Distribution Detection
Viaarxiv icon

Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation

Add code
Sep 22, 2023
Figure 1 for Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
Figure 2 for Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
Figure 3 for Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
Figure 4 for Background Activation Suppression for Weakly Supervised Object Localization and Semantic Segmentation
Viaarxiv icon

Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models

Add code
Aug 06, 2023
Figure 1 for Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
Figure 2 for Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
Figure 3 for Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
Figure 4 for Regularized Mask Tuning: Uncovering Hidden Knowledge in Pre-trained Vision-Language Models
Viaarxiv icon

Eliminating Lipschitz Singularities in Diffusion Models

Add code
Jun 20, 2023
Figure 1 for Eliminating Lipschitz Singularities in Diffusion Models
Figure 2 for Eliminating Lipschitz Singularities in Diffusion Models
Figure 3 for Eliminating Lipschitz Singularities in Diffusion Models
Figure 4 for Eliminating Lipschitz Singularities in Diffusion Models
Viaarxiv icon