Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation

Jan 18, 2019

Wei Sun, Tianfu Wu

Figure 1 for Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation

Figure 2 for Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation

Figure 3 for Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation

Figure 4 for Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation

Share this with someone who'll enjoy it:

Abstract:Image synthesis and image-to-image translation are two important generative learning tasks. Remarkable progress has been made by learning Generative Adversarial Networks (GANs)~\cite{goodfellow2014generative} and cycle-consistent GANs (CycleGANs)~\cite{zhu2017unpaired} respectively. This paper presents a method of learning Spatial Pyramid Attentive Pooling (SPAP) which is a novel architectural unit and can be easily integrated into both generators and discriminators in GANs and CycleGANs. The proposed SPAP integrates Atrous spatial pyramid~\cite{chen2018deeplab}, a proposed cascade attention mechanism and residual connections~\cite{he2016deep}. It leverages the advantages of the three components to facilitate effective end-to-end generative learning: (i) the capability of fusing multi-scale information by ASPP; (ii) the capability of capturing relative importance between both spatial locations (especially multi-scale context) or feature channels by attention; (iii) the capability of preserving information and enhancing optimization feasibility by residual connections. Coarse-to-fine and fine-to-coarse SPAP are studied and intriguing attention maps are observed in both tasks. In experiments, the proposed SPAP is tested in GANs on the Celeba-HQ-128 dataset~\cite{karras2017progressive}, and tested in CycleGANs on the Image-to-Image translation datasets including the Cityscape dataset~\cite{cordts2016cityscapes}, Facade and Aerial Maps dataset~\cite{zhu2017unpaired}, both obtaining better performance.

* 12 pages

View paper on

Share this with someone who'll enjoy it:

Title:Learning Spatial Pyramid Attentive Pooling in Image Synthesis and Image-to-Image Translation

Paper and Code