Picture for Xiaohu Jiang

Xiaohu Jiang

Supervised Fine-tuning in turn Improves Visual Foundation Models

Add code
Jan 18, 2024
Figure 1 for Supervised Fine-tuning in turn Improves Visual Foundation Models
Figure 2 for Supervised Fine-tuning in turn Improves Visual Foundation Models
Figure 3 for Supervised Fine-tuning in turn Improves Visual Foundation Models
Figure 4 for Supervised Fine-tuning in turn Improves Visual Foundation Models
Viaarxiv icon

Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks

Add code
Nov 17, 2022
Figure 1 for Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Figure 2 for Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Figure 3 for Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Figure 4 for Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks
Viaarxiv icon

Focal and Global Knowledge Distillation for Detectors

Add code
Nov 23, 2021
Figure 1 for Focal and Global Knowledge Distillation for Detectors
Figure 2 for Focal and Global Knowledge Distillation for Detectors
Figure 3 for Focal and Global Knowledge Distillation for Detectors
Figure 4 for Focal and Global Knowledge Distillation for Detectors
Viaarxiv icon

Guiding Query Position and Performing Similar Attention for Transformer-Based Detection Heads

Add code
Aug 22, 2021
Figure 1 for Guiding Query Position and Performing Similar Attention for Transformer-Based Detection Heads
Figure 2 for Guiding Query Position and Performing Similar Attention for Transformer-Based Detection Heads
Figure 3 for Guiding Query Position and Performing Similar Attention for Transformer-Based Detection Heads
Figure 4 for Guiding Query Position and Performing Similar Attention for Transformer-Based Detection Heads
Viaarxiv icon