Picture for Dongdong Chen

Dongdong Chen

Look Before You Match: Instance Understanding Matters in Video Object Segmentation

Add code
Dec 13, 2022
Figure 1 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 2 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 3 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 4 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Viaarxiv icon

CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet

Add code
Dec 12, 2022
Figure 1 for CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
Figure 2 for CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
Figure 3 for CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
Figure 4 for CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet
Viaarxiv icon

Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning

Add code
Dec 08, 2022
Viaarxiv icon

X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion

Add code
Dec 07, 2022
Figure 1 for X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion
Figure 2 for X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion
Figure 3 for X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion
Figure 4 for X-Paste: Revisit Copy-Paste at Scale with CLIP and StableDiffusion
Viaarxiv icon

Robust Point Cloud Segmentation with Noisy Annotations

Add code
Dec 06, 2022
Figure 1 for Robust Point Cloud Segmentation with Noisy Annotations
Figure 2 for Robust Point Cloud Segmentation with Noisy Annotations
Figure 3 for Robust Point Cloud Segmentation with Noisy Annotations
Figure 4 for Robust Point Cloud Segmentation with Noisy Annotations
Viaarxiv icon

Improving Commonsense in Vision-Language Models via Knowledge Graph Riddles

Add code
Nov 29, 2022
Viaarxiv icon

Self-Supervised Learning based on Heat Equation

Add code
Nov 23, 2022
Viaarxiv icon

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

Add code
Nov 22, 2022
Viaarxiv icon

PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition

Add code
Sep 16, 2022
Figure 1 for PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Figure 2 for PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Figure 3 for PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Figure 4 for PointCAT: Contrastive Adversarial Training for Robust Point Cloud Recognition
Viaarxiv icon

OmniVL:One Foundation Model for Image-Language and Video-Language Tasks

Add code
Sep 15, 2022
Figure 1 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 2 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 3 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Figure 4 for OmniVL:One Foundation Model for Image-Language and Video-Language Tasks
Viaarxiv icon