Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Songtao Liu

BorderDet: Border Feature for Dense Object Detection

Jul 21, 2020
Han Qiu, Yuchen Ma, Zeming Li, Songtao Liu, Jian Sun

Figure 1 for BorderDet: Border Feature for Dense Object Detection

Figure 2 for BorderDet: Border Feature for Dense Object Detection

Figure 3 for BorderDet: Border Feature for Dense Object Detection

Figure 4 for BorderDet: Border Feature for Dense Object Detection

Dense object detectors rely on the sliding-window paradigm that predicts the object over a regular grid of image. Meanwhile, the feature maps on the point of the grid are adopted to generate the bounding box predictions. The point feature is convenient to use but may lack the explicit border information for accurate localization. In this paper, We propose a simple and efficient operator called Border-Align to extract "border features" from the extreme point of the border to enhance the point feature. Based on the BorderAlign, we design a novel detection architecture called BorderDet, which explicitly exploits the border information for stronger classification and more accurate localization. With ResNet-50 backbone, our method improves single-stage detector FCOS by 2.8 AP gains (38.6 v.s. 41.4). With the ResNeXt-101-DCN backbone, our BorderDet obtains 50.3 AP, outperforming the existing state-of-the-art approaches. The code is available at (https://github.com/Megvii-BaseDetection/BorderDet).

* Accepted by ECCV 2020 as Oral

Via

Access Paper or Ask Questions

Multi-Scale Positive Sample Refinement for Few-Shot Object Detection

Jul 18, 2020
Jiaxi Wu, Songtao Liu, Di Huang, Yunhong Wang

Figure 1 for Multi-Scale Positive Sample Refinement for Few-Shot Object Detection

Figure 2 for Multi-Scale Positive Sample Refinement for Few-Shot Object Detection

Figure 3 for Multi-Scale Positive Sample Refinement for Few-Shot Object Detection

Figure 4 for Multi-Scale Positive Sample Refinement for Few-Shot Object Detection

Few-shot object detection (FSOD) helps detectors adapt to unseen classes with few training instances, and is useful when manual annotation is time-consuming or data acquisition is limited. Unlike previous attempts that exploit few-shot classification techniques to facilitate FSOD, this work highlights the necessity of handling the problem of scale variations, which is challenging due to the unique sample distribution. To this end, we propose a Multi-scale Positive Sample Refinement (MPSR) approach to enrich object scales in FSOD. It generates multi-scale positive samples as object pyramids and refines the prediction at various scales. We demonstrate its advantage by integrating it as an auxiliary branch to the popular architecture of Faster R-CNN with FPN, delivering a strong FSOD solution. Several experiments are conducted on PASCAL VOC and MS COCO, and the proposed approach achieves state of the art results and significantly outperforms other counterparts, which shows its effectiveness. Code is available at https://github.com/jiaxi-wu/MPSR.

* Accepted at ECCV 2020

Via

Access Paper or Ask Questions

AutoAssign: Differentiable Label Assignment for Dense Object Detection

Jul 07, 2020
Benjin Zhu, Jianfeng Wang, Zhengkai Jiang, Fuhang Zong, Songtao Liu, Zeming Li, Jian Sun

Figure 1 for AutoAssign: Differentiable Label Assignment for Dense Object Detection

Figure 2 for AutoAssign: Differentiable Label Assignment for Dense Object Detection

Figure 3 for AutoAssign: Differentiable Label Assignment for Dense Object Detection

Figure 4 for AutoAssign: Differentiable Label Assignment for Dense Object Detection

In this paper, we propose an anchor-free object detector with a fully differentiable label assignment strategy, named AutoAssign. It automatically determines positive/negative samples by generating positive and negative weight maps to modify each location's prediction dynamically. Specifically, we present a center weighting module to adjust the category-specific prior distributions and a confidence weighting module to adapt the specific assign strategy of each instance. The entire label assignment process is differentiable and requires no additional modification to transfer to different datasets and tasks. Extensive experiments on MS COCO show that our method steadily surpasses other best sampling strategies by $ \sim $ 1\% AP with various backbones. Moreover, our best model achieves 52.1\% AP, outperforming all existing one-stage detectors. Besides, experiments on other datasets, \emph{e.g.}, PASCAL VOC, Objects365, and WiderFace, demonstrate the broad applicability of AutoAssign.

* Rejected by ECCV 2020; Reformated

Via

Access Paper or Ask Questions

Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

Mar 23, 2020
Yangtao Zheng, Di Huang, Songtao Liu, Yunhong Wang

Figure 1 for Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

Figure 2 for Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

Figure 3 for Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

Figure 4 for Cross-domain Object Detection through Coarse-to-Fine Feature Adaptation

Recent years have witnessed great progress in deep learning based object detection. However, due to the domain shift problem, applying off-the-shelf detectors to an unseen domain leads to significant performance drop. To address such an issue, this paper proposes a novel coarse-to-fine feature adaptation approach to cross-domain object detection. At the coarse-grained stage, different from the rough image-level or instance-level feature alignment used in the literature, foreground regions are extracted by adopting the attention mechanism, and aligned according to their marginal distributions via multi-layer adversarial learning in the common feature space. At the fine-grained stage, we conduct conditional distribution alignment of foregrounds by minimizing the distance of global prototypes with the same category but from different domains. Thanks to this coarse-to-fine feature adaptation, domain knowledge in foreground regions can be effectively transferred. Extensive experiments are carried out in various cross-domain detection scenarios. The results are state-of-the-art, which demonstrate the broad applicability and effectiveness of the proposed approach.

Via

Access Paper or Ask Questions

Learning Spatial Fusion for Single-Shot Object Detection

Nov 25, 2019
Songtao Liu, Di Huang, Yunhong Wang

Figure 1 for Learning Spatial Fusion for Single-Shot Object Detection

Figure 2 for Learning Spatial Fusion for Single-Shot Object Detection

Figure 3 for Learning Spatial Fusion for Single-Shot Object Detection

Figure 4 for Learning Spatial Fusion for Single-Shot Object Detection

Pyramidal feature representation is the common practice to address the challenge of scale variation in object detection. However, the inconsistency across different feature scales is a primary limitation for the single-shot detectors based on feature pyramid. In this work, we propose a novel and data driven strategy for pyramidal feature fusion, referred to as adaptively spatial feature fusion (ASFF). It learns the way to spatially filter conflictive information to suppress the inconsistency, thus improving the scale-invariance of features, and introduces nearly free inference overhead. With the ASFF strategy and a solid baseline of YOLOv3, we achieve the best speed-accuracy trade-off on the MS COCO dataset, reporting 38.1% AP at 60 FPS, 42.4% AP at 45 FPS and 43.9% AP at 29 FPS. The code is available at https://github.com/ruinmessi/ASFF

Via

Access Paper or Ask Questions

Higher-order Weighted Graph Convolutional Networks

Nov 12, 2019
Songtao Liu, Lingwei Chen, Hanze Dong, Zihao Wang, Dinghao Wu, Zengfeng Huang

Figure 1 for Higher-order Weighted Graph Convolutional Networks

Figure 2 for Higher-order Weighted Graph Convolutional Networks

Figure 3 for Higher-order Weighted Graph Convolutional Networks

Figure 4 for Higher-order Weighted Graph Convolutional Networks

Graph Convolution Network (GCN) has been recognized as one of the most effective graph models for semi-supervised learning, but it extracts merely the first-order or few-order neighborhood information through information propagation, which suffers performance drop-off for deeper structure. Existing approaches that deal with the higher-order neighbors tend to take advantage of adjacency matrix power. In this paper, we assume a seemly trivial condition that the higher-order neighborhood information may be similar to that of the first-order neighbors. Accordingly, we present an unsupervised approach to describe such similarities and learn the weight matrices of higher-order neighbors automatically through Lasso that minimizes the feature loss between the first-order and higher-order neighbors, based on which we formulate the new convolutional filter for GCN to learn the better node representations. Our model, called higher-order weighted GCN(HWGCN), has achieved the state-of-the-art results on a number of node classification tasks over Cora, Citeseer and Pubmed datasets.

* 15 pages

Via

Access Paper or Ask Questions

Adaptive NMS: Refining Pedestrian Detection in a Crowd

Apr 07, 2019
Songtao Liu, Di Huang, Yunhong Wang

Figure 1 for Adaptive NMS: Refining Pedestrian Detection in a Crowd

Figure 2 for Adaptive NMS: Refining Pedestrian Detection in a Crowd

Figure 3 for Adaptive NMS: Refining Pedestrian Detection in a Crowd

Figure 4 for Adaptive NMS: Refining Pedestrian Detection in a Crowd

Pedestrian detection in a crowd is a very challenging issue. This paper addresses this problem by a novel Non-Maximum Suppression (NMS) algorithm to better refine the bounding boxes given by detectors. The contributions are threefold: (1) we propose adaptive-NMS, which applies a dynamic suppression threshold to an instance, according to the target density; (2) we design an efficient subnetwork to learn density scores, which can be conveniently embedded into both the single-stage and two-stage detectors; and (3) we achieve state of the art results on the CityPersons and CrowdHuman benchmarks.

* To appear at CVPR 2019 (Oral)

Via

Access Paper or Ask Questions

Receptive Field Block Net for Accurate and Fast Object Detection

Jul 26, 2018
Songtao Liu, Di Huang, Yunhong Wang

Figure 1 for Receptive Field Block Net for Accurate and Fast Object Detection

Figure 2 for Receptive Field Block Net for Accurate and Fast Object Detection

Figure 3 for Receptive Field Block Net for Accurate and Fast Object Detection

Figure 4 for Receptive Field Block Net for Accurate and Fast Object Detection

Current top-performing object detectors depend on deep CNN backbones, such as ResNet-101 and Inception, benefiting from their powerful feature representations but suffering from high computational costs. Conversely, some lightweight model based detectors fulfil real time processing, while their accuracies are often criticized. In this paper, we explore an alternative to build a fast and accurate detector by strengthening lightweight features using a hand-crafted mechanism. Inspired by the structure of Receptive Fields (RFs) in human visual systems, we propose a novel RF Block (RFB) module, which takes the relationship between the size and eccentricity of RFs into account, to enhance the feature discriminability and robustness. We further assemble RFB to the top of SSD, constructing the RFB Net detector. To evaluate its effectiveness, experiments are conducted on two major benchmarks and the results show that RFB Net is able to reach the performance of advanced very deep detectors while keeping the real-time speed. Code is available at https://github.com/ruinmessi/RFBNet.

* Accepted by ECCV 2018

Via

Access Paper or Ask Questions