Picture for Song Bai

Song Bai

Alibaba Group, University of Oxford

TransMix: Attend to Mix for Vision Transformers

Add code
Nov 18, 2021
Figure 1 for TransMix: Attend to Mix for Vision Transformers
Figure 2 for TransMix: Attend to Mix for Vision Transformers
Figure 3 for TransMix: Attend to Mix for Vision Transformers
Figure 4 for TransMix: Attend to Mix for Vision Transformers
Viaarxiv icon

Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence

Add code
Nov 18, 2021
Figure 1 for Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence
Figure 2 for Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence
Figure 3 for Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence
Figure 4 for Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence
Viaarxiv icon

Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge

Add code
Nov 15, 2021
Figure 1 for Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge
Figure 2 for Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge
Figure 3 for Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge
Figure 4 for Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge
Viaarxiv icon

Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation

Add code
Nov 15, 2021
Figure 1 for Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation
Figure 2 for Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation
Figure 3 for Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation
Figure 4 for Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation
Viaarxiv icon

CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis

Add code
Sep 03, 2021
Figure 1 for CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis
Figure 2 for CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis
Figure 3 for CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis
Figure 4 for CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis
Viaarxiv icon

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

Add code
Jul 27, 2021
Figure 1 for PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Figure 2 for PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Figure 3 for PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Figure 4 for PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Viaarxiv icon

End-to-end Temporal Action Detection with Transformer

Add code
Jul 14, 2021
Figure 1 for End-to-end Temporal Action Detection with Transformer
Figure 2 for End-to-end Temporal Action Detection with Transformer
Figure 3 for End-to-end Temporal Action Detection with Transformer
Figure 4 for End-to-end Temporal Action Detection with Transformer
Viaarxiv icon

Visual Parser: Representing Part-whole Hierarchies with Transformers

Add code
Jul 13, 2021
Figure 1 for Visual Parser: Representing Part-whole Hierarchies with Transformers
Figure 2 for Visual Parser: Representing Part-whole Hierarchies with Transformers
Figure 3 for Visual Parser: Representing Part-whole Hierarchies with Transformers
Figure 4 for Visual Parser: Representing Part-whole Hierarchies with Transformers
Viaarxiv icon

I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition

Add code
May 18, 2021
Figure 1 for I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Figure 2 for I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Figure 3 for I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Figure 4 for I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Viaarxiv icon

Location-Sensitive Visual Recognition with Cross-IOU Loss

Add code
Apr 11, 2021
Figure 1 for Location-Sensitive Visual Recognition with Cross-IOU Loss
Figure 2 for Location-Sensitive Visual Recognition with Cross-IOU Loss
Figure 3 for Location-Sensitive Visual Recognition with Cross-IOU Loss
Figure 4 for Location-Sensitive Visual Recognition with Cross-IOU Loss
Viaarxiv icon