Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Hard-Attention for Scalable Image Classification

Feb 20, 2021

Athanasios Papadopoulos, Paweł Korus, Nasir Memon

Figure 1 for Hard-Attention for Scalable Image Classification

Figure 2 for Hard-Attention for Scalable Image Classification

Figure 3 for Hard-Attention for Scalable Image Classification

Figure 4 for Hard-Attention for Scalable Image Classification

Share this with someone who'll enjoy it:

Abstract:Deep neural networks (DNNs) are typically optimized for a specific input resolution (e.g. $224 \times 224$ px) and their adoption to inputs of higher resolution (e.g., satellite or medical images) remains challenging, as it leads to excessive computation and memory overhead, and may require substantial engineering effort (e.g., streaming). We show that multi-scale hard-attention can be an effective solution to this problem. We propose a novel architecture, TNet, which traverses an image pyramid in a top-down fashion, visiting only the most informative regions along the way. We compare our model against strong hard-attention baselines, achieving a better trade-off between resources and accuracy on ImageNet. We further verify the efficacy of our model on satellite images (fMoW dataset) of size up to $896 \times 896$ px. In addition, our hard-attention mechanism guarantees predictions with a degree of interpretability, without extra cost beyond inference. We also show that we can reduce data acquisition and annotation cost, since our model attends only to a fraction of the highest resolution content, while using only image-level labels without bounding boxes.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Hard-Attention for Scalable Image Classification

Paper and Code