Alert button
Picture for Yutong Lin

Yutong Lin

Alert button

V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection

Add code
Bookmark button
Alert button
Aug 08, 2023
Yichao Shen, Zigang Geng, Yuhui Yuan, Yutong Lin, Ze Liu, Chunyu Wang, Han Hu, Nanning Zheng, Baining Guo

Figure 1 for V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Figure 2 for V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Figure 3 for V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Figure 4 for V-DETR: DETR with Vertex Relative Position Encoding for 3D Object Detection
Viaarxiv icon

DETR Doesn't Need Multi-Scale or Locality Design

Add code
Bookmark button
Alert button
Aug 03, 2023
Yutong Lin, Yuhui Yuan, Zheng Zhang, Chen Li, Nanning Zheng, Han Hu

Figure 1 for DETR Doesn't Need Multi-Scale or Locality Design
Figure 2 for DETR Doesn't Need Multi-Scale or Locality Design
Figure 3 for DETR Doesn't Need Multi-Scale or Locality Design
Figure 4 for DETR Doesn't Need Multi-Scale or Locality Design
Viaarxiv icon

Could Giant Pretrained Image Models Extract Universal Representations?

Add code
Bookmark button
Alert button
Nov 03, 2022
Yutong Lin, Ze Liu, Zheng Zhang, Han Hu, Nanning Zheng, Stephen Lin, Yue Cao

Figure 1 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 2 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 3 for Could Giant Pretrained Image Models Extract Universal Representations?
Figure 4 for Could Giant Pretrained Image Models Extract Universal Representations?
Viaarxiv icon

On Data Scaling in Masked Image Modeling

Add code
Bookmark button
Alert button
Jun 09, 2022
Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Yixuan Wei, Qi Dai, Han Hu

Figure 1 for On Data Scaling in Masked Image Modeling
Figure 2 for On Data Scaling in Masked Image Modeling
Figure 3 for On Data Scaling in Masked Image Modeling
Figure 4 for On Data Scaling in Masked Image Modeling
Viaarxiv icon

A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model

Add code
Bookmark button
Alert button
Dec 29, 2021
Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Han Hu, Xiang Bai

Figure 1 for A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model
Figure 2 for A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model
Figure 3 for A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model
Figure 4 for A Simple Baseline for Zero-shot Semantic Segmentation with Pre-trained Vision-language Model
Viaarxiv icon

SimMIM: A Simple Framework for Masked Image Modeling

Add code
Bookmark button
Alert button
Nov 18, 2021
Zhenda Xie, Zheng Zhang, Yue Cao, Yutong Lin, Jianmin Bao, Zhuliang Yao, Qi Dai, Han Hu

Figure 1 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 2 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 3 for SimMIM: A Simple Framework for Masked Image Modeling
Figure 4 for SimMIM: A Simple Framework for Masked Image Modeling
Viaarxiv icon

Swin Transformer V2: Scaling Up Capacity and Resolution

Add code
Bookmark button
Alert button
Nov 18, 2021
Ze Liu, Han Hu, Yutong Lin, Zhuliang Yao, Zhenda Xie, Yixuan Wei, Jia Ning, Yue Cao, Zheng Zhang, Li Dong, Furu Wei, Baining Guo

Figure 1 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 2 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 3 for Swin Transformer V2: Scaling Up Capacity and Resolution
Figure 4 for Swin Transformer V2: Scaling Up Capacity and Resolution
Viaarxiv icon

Bootstrap Your Object Detector via Mixed Training

Add code
Bookmark button
Alert button
Nov 04, 2021
Mengde Xu, Zheng Zhang, Fangyun Wei, Yutong Lin, Yue Cao, Stephen Lin, Han Hu, Xiang Bai

Figure 1 for Bootstrap Your Object Detector via Mixed Training
Figure 2 for Bootstrap Your Object Detector via Mixed Training
Figure 3 for Bootstrap Your Object Detector via Mixed Training
Figure 4 for Bootstrap Your Object Detector via Mixed Training
Viaarxiv icon

Self-Supervised Learning with Swin Transformers

Add code
Bookmark button
Alert button
May 11, 2021
Zhenda Xie, Yutong Lin, Zhuliang Yao, Zheng Zhang, Qi Dai, Yue Cao, Han Hu

Figure 1 for Self-Supervised Learning with Swin Transformers
Figure 2 for Self-Supervised Learning with Swin Transformers
Figure 3 for Self-Supervised Learning with Swin Transformers
Figure 4 for Self-Supervised Learning with Swin Transformers
Viaarxiv icon

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Add code
Bookmark button
Alert button
Mar 25, 2021
Ze Liu, Yutong Lin, Yue Cao, Han Hu, Yixuan Wei, Zheng Zhang, Stephen Lin, Baining Guo

Figure 1 for Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Figure 2 for Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Figure 3 for Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Figure 4 for Swin Transformer: Hierarchical Vision Transformer using Shifted Windows
Viaarxiv icon