Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Per-Pixel Classification is Not All You Need for Semantic Segmentation

Jul 13, 2021

Bowen Cheng, Alexander G. Schwing, Alexander Kirillov

Figure 1 for Per-Pixel Classification is Not All You Need for Semantic Segmentation

Figure 2 for Per-Pixel Classification is Not All You Need for Semantic Segmentation

Figure 3 for Per-Pixel Classification is Not All You Need for Semantic Segmentation

Figure 4 for Per-Pixel Classification is Not All You Need for Semantic Segmentation

Share this with someone who'll enjoy it:

Abstract:Modern approaches typically formulate semantic segmentation as a per-pixel classification task, while instance-level segmentation is handled with an alternative mask classification. Our key insight: mask classification is sufficiently general to solve both semantic- and instance-level segmentation tasks in a unified manner using the exact same model, loss, and training procedure. Following this observation, we propose MaskFormer, a simple mask classification model which predicts a set of binary masks, each associated with a single global class label prediction. Overall, the proposed mask classification-based method simplifies the landscape of effective approaches to semantic and panoptic segmentation tasks and shows excellent empirical results. In particular, we observe that MaskFormer outperforms per-pixel classification baselines when the number of classes is large. Our mask classification-based method outperforms both current state-of-the-art semantic (55.6 mIoU on ADE20K) and panoptic segmentation (52.7 PQ on COCO) models.

* Project page: https://bowenc0221.github.io/maskformer

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Per-Pixel Classification is Not All You Need for Semantic Segmentation

Paper and Code