Alert button
Picture for Xiyang Dai

Xiyang Dai

Alert button

OmniTracker: Unifying Object Tracking by Tracking-with-Detection

Add code
Bookmark button
Alert button
Mar 21, 2023
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Xiyang Dai, Lu Yuan, Yu-Gang Jiang

Figure 1 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 2 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 3 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Figure 4 for OmniTracker: Unifying Object Tracking by Tracking-with-Detection
Viaarxiv icon

Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations

Add code
Bookmark button
Alert button
Feb 27, 2023
Ziyu Jiang, Yinpeng Chen, Mengchen Liu, Dongdong Chen, Xiyang Dai, Lu Yuan, Zicheng Liu, Zhangyang Wang

Figure 1 for Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Figure 2 for Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Figure 3 for Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Figure 4 for Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Label-Efficient Representations
Viaarxiv icon

Generalized Decoding for Pixel, Image, and Language

Add code
Bookmark button
Alert button
Dec 21, 2022
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao

Figure 1 for Generalized Decoding for Pixel, Image, and Language
Figure 2 for Generalized Decoding for Pixel, Image, and Language
Figure 3 for Generalized Decoding for Pixel, Image, and Language
Figure 4 for Generalized Decoding for Pixel, Image, and Language
Viaarxiv icon

Look Before You Match: Instance Understanding Matters in Video Object Segmentation

Add code
Bookmark button
Alert button
Dec 13, 2022
Junke Wang, Dongdong Chen, Zuxuan Wu, Chong Luo, Chuanxin Tang, Xiyang Dai, Yucheng Zhao, Yujia Xie, Lu Yuan, Yu-Gang Jiang

Figure 1 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 2 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 3 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Figure 4 for Look Before You Match: Instance Understanding Matters in Video Object Segmentation
Viaarxiv icon

Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning

Add code
Bookmark button
Alert button
Dec 08, 2022
Rui Wang, Dongdong Chen, Zuxuan Wu, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Lu Yuan, Yu-Gang Jiang

Figure 1 for Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Figure 2 for Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Figure 3 for Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Figure 4 for Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning
Viaarxiv icon

Self-Supervised Learning based on Heat Equation

Add code
Bookmark button
Alert button
Nov 23, 2022
Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Lu Yuan, Zicheng Liu, Youzuo Lin

Figure 1 for Self-Supervised Learning based on Heat Equation
Figure 2 for Self-Supervised Learning based on Heat Equation
Figure 3 for Self-Supervised Learning based on Heat Equation
Figure 4 for Self-Supervised Learning based on Heat Equation
Viaarxiv icon

Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling

Add code
Bookmark button
Alert button
Aug 25, 2022
Rui Wang, Zuxuan Wu, Dongdong Chen, Yinpeng Chen, Xiyang Dai, Mengchen Liu, Luowei Zhou, Lu Yuan, Yu-Gang Jiang

Figure 1 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 2 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 3 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Figure 4 for Video Mobile-Former: Video Recognition with Efficient Global Spatial-temporal Modeling
Viaarxiv icon

Should All Proposals be Treated Equally in Object Detection?

Add code
Bookmark button
Alert button
Jul 07, 2022
Yunsheng Li, Yinpeng Chen, Xiyang Dai, Dongdong Chen, Mengchen Liu, Pei Yu, Jing Yin, Lu Yuan, Zicheng Liu, Nuno Vasconcelos

Figure 1 for Should All Proposals be Treated Equally in Object Detection?
Figure 2 for Should All Proposals be Treated Equally in Object Detection?
Figure 3 for Should All Proposals be Treated Equally in Object Detection?
Figure 4 for Should All Proposals be Treated Equally in Object Detection?
Viaarxiv icon

GLIPv2: Unifying Localization and Vision-Language Understanding

Add code
Bookmark button
Alert button
Jun 12, 2022
Haotian Zhang, Pengchuan Zhang, Xiaowei Hu, Yen-Chun Chen, Liunian Harold Li, Xiyang Dai, Lijuan Wang, Lu Yuan, Jenq-Neng Hwang, Jianfeng Gao

Figure 1 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 2 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 3 for GLIPv2: Unifying Localization and Vision-Language Understanding
Figure 4 for GLIPv2: Unifying Localization and Vision-Language Understanding
Viaarxiv icon

Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding

Add code
Bookmark button
Alert button
Jun 07, 2022
Lingchen Meng, Xiyang Dai, Yinpeng Chen, Pengchuan Zhang, Dongdong Chen, Mengchen Liu, Jianfeng Wang, Zuxuan Wu, Lu Yuan, Yu-Gang Jiang

Figure 1 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 2 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 3 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Figure 4 for Detection Hub: Unifying Object Detection Datasets via Query Adaptation on Language Embedding
Viaarxiv icon