Picture for Zhuowen Tu

Zhuowen Tu

Open-Vocabulary Panoptic Segmentation with MaskCLIP

Add code
Aug 18, 2022
Figure 1 for Open-Vocabulary Panoptic Segmentation with MaskCLIP
Figure 2 for Open-Vocabulary Panoptic Segmentation with MaskCLIP
Figure 3 for Open-Vocabulary Panoptic Segmentation with MaskCLIP
Figure 4 for Open-Vocabulary Panoptic Segmentation with MaskCLIP
Viaarxiv icon

Semi-supervised Vision Transformers at Scale

Add code
Aug 11, 2022
Figure 1 for Semi-supervised Vision Transformers at Scale
Figure 2 for Semi-supervised Vision Transformers at Scale
Figure 3 for Semi-supervised Vision Transformers at Scale
Figure 4 for Semi-supervised Vision Transformers at Scale
Viaarxiv icon

The Geometry of Multilingual Language Model Representations

Add code
May 22, 2022
Figure 1 for The Geometry of Multilingual Language Model Representations
Figure 2 for The Geometry of Multilingual Language Model Representations
Figure 3 for The Geometry of Multilingual Language Model Representations
Figure 4 for The Geometry of Multilingual Language Model Representations
Viaarxiv icon

ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training

Add code
May 13, 2022
Figure 1 for ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training
Figure 2 for ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training
Figure 3 for ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training
Figure 4 for ELODI: Ensemble Logit Difference Inhibition for Positive-Congruent Training
Viaarxiv icon

X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks

Add code
Apr 12, 2022
Figure 1 for X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
Figure 2 for X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
Figure 3 for X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
Figure 4 for X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
Viaarxiv icon

Text Spotting Transformers

Add code
Apr 05, 2022
Figure 1 for Text Spotting Transformers
Figure 2 for Text Spotting Transformers
Figure 3 for Text Spotting Transformers
Figure 4 for Text Spotting Transformers
Viaarxiv icon

MeMOT: Multi-Object Tracking with Memory

Add code
Mar 31, 2022
Figure 1 for MeMOT: Multi-Object Tracking with Memory
Figure 2 for MeMOT: Multi-Object Tracking with Memory
Figure 3 for MeMOT: Multi-Object Tracking with Memory
Figure 4 for MeMOT: Multi-Object Tracking with Memory
Viaarxiv icon

Contrastive Neighborhood Alignment

Add code
Jan 06, 2022
Figure 1 for Contrastive Neighborhood Alignment
Figure 2 for Contrastive Neighborhood Alignment
Figure 3 for Contrastive Neighborhood Alignment
Figure 4 for Contrastive Neighborhood Alignment
Viaarxiv icon

Towards Panoptic 3D Parsing for Single Image in the Wild

Add code
Nov 29, 2021
Figure 1 for Towards Panoptic 3D Parsing for Single Image in the Wild
Figure 2 for Towards Panoptic 3D Parsing for Single Image in the Wild
Figure 3 for Towards Panoptic 3D Parsing for Single Image in the Wild
Figure 4 for Towards Panoptic 3D Parsing for Single Image in the Wild
Viaarxiv icon

ViTGAN: Training GANs with Vision Transformers

Add code
Jul 09, 2021
Figure 1 for ViTGAN: Training GANs with Vision Transformers
Figure 2 for ViTGAN: Training GANs with Vision Transformers
Figure 3 for ViTGAN: Training GANs with Vision Transformers
Figure 4 for ViTGAN: Training GANs with Vision Transformers
Viaarxiv icon