Picture for Kai Han

Kai Han

and Other Contributors

ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks

Add code
Jun 26, 2023
Figure 1 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Figure 2 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Figure 3 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Figure 4 for ParameterNet: Parameters Are All You Need for Large-scale Visual Pretraining of Mobile Networks
Viaarxiv icon

GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?

Add code
Jun 07, 2023
Figure 1 for GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?
Figure 2 for GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?
Figure 3 for GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?
Figure 4 for GPT4Image: Can Large Pre-trained Models Help Vision Models on Perception Tasks?
Viaarxiv icon

HeadSculpt: Crafting 3D Head Avatars with Text

Add code
Jun 05, 2023
Viaarxiv icon

ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Add code
Jun 01, 2023
Figure 1 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Figure 2 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Figure 3 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Figure 4 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Viaarxiv icon

GPT4GEO: How a Language Model Sees the World's Geography

Add code
May 30, 2023
Figure 1 for GPT4GEO: How a Language Model Sees the World's Geography
Figure 2 for GPT4GEO: How a Language Model Sees the World's Geography
Figure 3 for GPT4GEO: How a Language Model Sees the World's Geography
Figure 4 for GPT4GEO: How a Language Model Sees the World's Geography
Viaarxiv icon

VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale

Add code
May 25, 2023
Figure 1 for VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale
Figure 2 for VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale
Figure 3 for VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale
Figure 4 for VanillaKD: Revisit the Power of Vanilla Knowledge Distillation from Small Scale to Large Scale
Viaarxiv icon

Learning Semi-supervised Gaussian Mixture Models for Generalized Category Discovery

Add code
May 10, 2023
Viaarxiv icon

SimSC: A Simple Framework for Semantic Correspondence with Temperature Learning

Add code
May 03, 2023
Viaarxiv icon

SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models

Add code
Apr 23, 2023
Figure 1 for SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models
Figure 2 for SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models
Figure 3 for SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models
Figure 4 for SATIN: A Multi-Task Metadataset for Classifying Satellite Imagery using Vision-Language Models
Viaarxiv icon

CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery

Add code
Apr 14, 2023
Figure 1 for CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery
Figure 2 for CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery
Figure 3 for CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery
Figure 4 for CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery
Viaarxiv icon