Picture for Song Bai

Song Bai

Alibaba Group, University of Oxford

CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis

Add code
Sep 03, 2021
Figure 1 for CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis
Figure 2 for CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis
Figure 3 for CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis
Figure 4 for CAP-Net: Correspondence-Aware Point-view Fusion Network for 3D Shape Analysis
Viaarxiv icon

PlaneTR: Structure-Guided Transformers for 3D Plane Recovery

Add code
Jul 27, 2021
Figure 1 for PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Figure 2 for PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Figure 3 for PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Figure 4 for PlaneTR: Structure-Guided Transformers for 3D Plane Recovery
Viaarxiv icon

End-to-end Temporal Action Detection with Transformer

Add code
Jul 14, 2021
Figure 1 for End-to-end Temporal Action Detection with Transformer
Figure 2 for End-to-end Temporal Action Detection with Transformer
Figure 3 for End-to-end Temporal Action Detection with Transformer
Figure 4 for End-to-end Temporal Action Detection with Transformer
Viaarxiv icon

Visual Parser: Representing Part-whole Hierarchies with Transformers

Add code
Jul 13, 2021
Figure 1 for Visual Parser: Representing Part-whole Hierarchies with Transformers
Figure 2 for Visual Parser: Representing Part-whole Hierarchies with Transformers
Figure 3 for Visual Parser: Representing Part-whole Hierarchies with Transformers
Figure 4 for Visual Parser: Representing Part-whole Hierarchies with Transformers
Viaarxiv icon

I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition

Add code
May 18, 2021
Figure 1 for I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Figure 2 for I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Figure 3 for I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Figure 4 for I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition
Viaarxiv icon

Location-Sensitive Visual Recognition with Cross-IOU Loss

Add code
Apr 11, 2021
Figure 1 for Location-Sensitive Visual Recognition with Cross-IOU Loss
Figure 2 for Location-Sensitive Visual Recognition with Cross-IOU Loss
Figure 3 for Location-Sensitive Visual Recognition with Cross-IOU Loss
Figure 4 for Location-Sensitive Visual Recognition with Cross-IOU Loss
Viaarxiv icon

Anchor-Free Person Search

Add code
Mar 22, 2021
Figure 1 for Anchor-Free Person Search
Figure 2 for Anchor-Free Person Search
Figure 3 for Anchor-Free Person Search
Figure 4 for Anchor-Free Person Search
Viaarxiv icon

SwiftNet: Real-time Video Object Segmentation

Add code
Feb 09, 2021
Figure 1 for SwiftNet: Real-time Video Object Segmentation
Figure 2 for SwiftNet: Real-time Video Object Segmentation
Figure 3 for SwiftNet: Real-time Video Object Segmentation
Figure 4 for SwiftNet: Real-time Video Object Segmentation
Viaarxiv icon

Occluded Video Instance Segmentation

Add code
Feb 08, 2021
Figure 1 for Occluded Video Instance Segmentation
Figure 2 for Occluded Video Instance Segmentation
Figure 3 for Occluded Video Instance Segmentation
Figure 4 for Occluded Video Instance Segmentation
Viaarxiv icon

Multi-shot Temporal Event Localization: a Benchmark

Add code
Dec 17, 2020
Figure 1 for Multi-shot Temporal Event Localization: a Benchmark
Figure 2 for Multi-shot Temporal Event Localization: a Benchmark
Figure 3 for Multi-shot Temporal Event Localization: a Benchmark
Figure 4 for Multi-shot Temporal Event Localization: a Benchmark
Viaarxiv icon