Alert button
Picture for Zhaowei Cai

Zhaowei Cai

Alert button

Mixed-Query Transformer: A Unified Image Segmentation Architecture

Add code
Bookmark button
Alert button
Apr 06, 2024
Pei Wang, Zhaowei Cai, Hao Yang, Ashwin Swaminathan, R. Manmatha, Stefano Soatto

Viaarxiv icon

Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts

Add code
Bookmark button
Alert button
May 11, 2023
Zhaoyang Zhang, Yantao Shen, Kunyu Shi, Zhaowei Cai, Jun Fang, Siqi Deng, Hao Yang, Davide Modolo, Zhuowen Tu, Stefano Soatto

Figure 1 for Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts
Figure 2 for Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts
Figure 3 for Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts
Figure 4 for Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts
Viaarxiv icon

PolyFormer: Referring Image Segmentation as Sequential Polygon Generation

Add code
Bookmark button
Alert button
Feb 14, 2023
Jiang Liu, Hui Ding, Zhaowei Cai, Yuting Zhang, Ravi Kumar Satzoda, Vijay Mahadevan, R. Manmatha

Figure 1 for PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Figure 2 for PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Figure 3 for PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Figure 4 for PolyFormer: Referring Image Segmentation as Sequential Polygon Generation
Viaarxiv icon

Semi-supervised Vision Transformers at Scale

Add code
Bookmark button
Alert button
Aug 11, 2022
Zhaowei Cai, Avinash Ravichandran, Paolo Favaro, Manchen Wang, Davide Modolo, Rahul Bhotika, Zhuowen Tu, Stefano Soatto

Figure 1 for Semi-supervised Vision Transformers at Scale
Figure 2 for Semi-supervised Vision Transformers at Scale
Figure 3 for Semi-supervised Vision Transformers at Scale
Figure 4 for Semi-supervised Vision Transformers at Scale
Viaarxiv icon

Masked Vision and Language Modeling for Multi-modal Representation Learning

Add code
Bookmark button
Alert button
Aug 03, 2022
Gukyeong Kwon, Zhaowei Cai, Avinash Ravichandran, Erhan Bas, Rahul Bhotika, Stefano Soatto

Figure 1 for Masked Vision and Language Modeling for Multi-modal Representation Learning
Figure 2 for Masked Vision and Language Modeling for Multi-modal Representation Learning
Figure 3 for Masked Vision and Language Modeling for Multi-modal Representation Learning
Figure 4 for Masked Vision and Language Modeling for Multi-modal Representation Learning
Viaarxiv icon

Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark

Add code
Bookmark button
Alert button
Jul 22, 2022
Kibok Lee, Hao Yang, Satyaki Chakraborty, Zhaowei Cai, Gurumurthy Swaminathan, Avinash Ravichandran, Onkar Dabeer

Figure 1 for Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark
Figure 2 for Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark
Figure 3 for Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark
Figure 4 for Rethinking Few-Shot Object Detection on a Multi-Domain Benchmark
Viaarxiv icon

X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks

Add code
Bookmark button
Alert button
Apr 12, 2022
Zhaowei Cai, Gukyeong Kwon, Avinash Ravichandran, Erhan Bas, Zhuowen Tu, Rahul Bhotika, Stefano Soatto

Figure 1 for X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
Figure 2 for X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
Figure 3 for X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
Figure 4 for X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks
Viaarxiv icon

Omni-DETR: Omni-Supervised Object Detection with Transformers

Add code
Bookmark button
Alert button
Mar 30, 2022
Pei Wang, Zhaowei Cai, Hao Yang, Gurumurthy Swaminathan, Nuno Vasconcelos, Bernt Schiele, Stefano Soatto

Figure 1 for Omni-DETR: Omni-Supervised Object Detection with Transformers
Figure 2 for Omni-DETR: Omni-Supervised Object Detection with Transformers
Figure 3 for Omni-DETR: Omni-Supervised Object Detection with Transformers
Figure 4 for Omni-DETR: Omni-Supervised Object Detection with Transformers
Viaarxiv icon

Contrastive Neighborhood Alignment

Add code
Bookmark button
Alert button
Jan 06, 2022
Pengkai Zhu, Zhaowei Cai, Yuanjun Xiong, Zhuowen Tu, Luis Goncalves, Vijay Mahadevan, Stefano Soatto

Figure 1 for Contrastive Neighborhood Alignment
Figure 2 for Contrastive Neighborhood Alignment
Figure 3 for Contrastive Neighborhood Alignment
Figure 4 for Contrastive Neighborhood Alignment
Viaarxiv icon

Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning

Add code
Bookmark button
Alert button
Jan 21, 2021
Zhaowei Cai, Avinash Ravichandran, Subhransu Maji, Charless Fowlkes, Zhuowen Tu, Stefano Soatto

Figure 1 for Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning
Figure 2 for Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning
Figure 3 for Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning
Figure 4 for Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning
Viaarxiv icon