Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Topic:photo style transfer

GLStyleNet: Higher Quality Style Transfer Combining Global and Local Pyramid Features

Nov 18, 2018

Zhizhong Wang, Lei Zhao, Wei Xing, Dongming Lu

Figure 1 for GLStyleNet: Higher Quality Style Transfer Combining Global and Local Pyramid Features

Figure 2 for GLStyleNet: Higher Quality Style Transfer Combining Global and Local Pyramid Features

Figure 3 for GLStyleNet: Higher Quality Style Transfer Combining Global and Local Pyramid Features

Figure 4 for GLStyleNet: Higher Quality Style Transfer Combining Global and Local Pyramid Features

Abstract:Recent studies using deep neural networks have shown remarkable success in style transfer especially for artistic and photo-realistic images. However, the approaches using global feature correlations fail to capture small, intricate textures and maintain correct texture scales of the artworks, and the approaches based on local patches are defective on global effect. In this paper, we present a novel feature pyramid fusion neural network, dubbed GLStyleNet, which sufficiently takes into consideration multi-scale and multi-level pyramid features by best aggregating layers across a VGG network, and performs style transfer hierarchically with multiple losses of different scales. Our proposed method retains high-frequency pixel information and low frequency construct information of images from two aspects: loss function constraint and feature fusion. Our approach is not only flexible to adjust the trade-off between content and style, but also controllable between global and local. Compared to state-of-the-art methods, our method can transfer not just large-scale, obvious style cues but also subtle, exquisite ones, and dramatically improves the quality of style transfer. We demonstrate the effectiveness of our approach on portrait style transfer, artistic style transfer, photo-realistic style transfer and Chinese ancient painting style transfer tasks. Experimental results indicate that our unified approach improves image style transfer quality over previous state-of-the-art methods, while also accelerating the whole process in a certain extent. Our code is available at https://github.com/EndyWon/GLStyleNet.

Via

Access Paper or Ask Questions

One-Shot Mutual Affine-Transfer for Photorealistic Stylization

Jul 24, 2019

Ying Qu, Zhenzhou Shao, Hairong Qi

Figure 1 for One-Shot Mutual Affine-Transfer for Photorealistic Stylization

Figure 2 for One-Shot Mutual Affine-Transfer for Photorealistic Stylization

Figure 3 for One-Shot Mutual Affine-Transfer for Photorealistic Stylization

Figure 4 for One-Shot Mutual Affine-Transfer for Photorealistic Stylization

Abstract:Photorealistic style transfer aims to transfer the style of a reference photo onto a content photo naturally, such that the stylized image looks like a real photo taken by a camera. Existing state-of-the-art methods are prone to spatial structure distortion of the content image and global color inconsistency across different semantic objects, making the results less photorealistic. In this paper, we propose a one-shot mutual Dirichlet network, to address these challenging issues. The essential contribution of the work is the realization of a representation scheme that successfully decouples the spatial structure and color information of images, such that the spatial structure can be well preserved during stylization. This representation is discriminative and context-sensitive with respect to semantic objects. It is extracted with a shared sparse Dirichlet encoder. Moreover, such representation is encouraged to be matched between the content and style images for faithful color transfer. The affine-transfer model is embedded in the decoder of the network to facilitate the color transfer. The strong representative and discriminative power of the proposed network enables one-shot learning given only one content-style image pair. Experimental results demonstrate that the proposed method is able to generate photorealistic photos without spatial distortion or abrupt color changes.

Via

Access Paper or Ask Questions

Style Transfer With Adaptation to the Central Objects of the Scene

Jun 04, 2019

Alexey Schekalev, Victor Kitov

Figure 1 for Style Transfer With Adaptation to the Central Objects of the Scene

Figure 2 for Style Transfer With Adaptation to the Central Objects of the Scene

Figure 3 for Style Transfer With Adaptation to the Central Objects of the Scene

Figure 4 for Style Transfer With Adaptation to the Central Objects of the Scene

Abstract:Style transfer is a problem of rendering image with some content in the style of another image, for example a family photo in the style of a painting of some famous artist. The drawback of classical style transfer algorithm is that it imposes style uniformly on all parts of the content image, which perturbs central objects on the content image, such as faces or text, and makes them unrecognizable. This work proposes a novel style transfer algorithm which automatically detects central objects on the content image, generates spatial importance mask and imposes style non-uniformly: central objects are stylized less to preserve their recognizability and other parts of the image are stylized as usual to preserve the style. Three methods of automatic central object detection are proposed and evaluated qualitatively and via a user evaluation study. Both comparisons demonstrate higher quality of stylization compared to the classical style transfer method.

Via

Access Paper or Ask Questions

Streetscape augmentation using generative adversarial networks: insights related to health and wellbeing

May 14, 2019

Jasper S. Wijnands, Kerry A. Nice, Jason Thompson, Haifeng Zhao, Mark Stevenson

Figure 1 for Streetscape augmentation using generative adversarial networks: insights related to health and wellbeing

Figure 2 for Streetscape augmentation using generative adversarial networks: insights related to health and wellbeing

Figure 3 for Streetscape augmentation using generative adversarial networks: insights related to health and wellbeing

Figure 4 for Streetscape augmentation using generative adversarial networks: insights related to health and wellbeing

Abstract:Deep learning using neural networks has provided advances in image style transfer, merging the content of one image (e.g., a photo) with the style of another (e.g., a painting). Our research shows this concept can be extended to analyse the design of streetscapes in relation to health and wellbeing outcomes. An Australian population health survey (n=34,000) was used to identify the spatial distribution of health and wellbeing outcomes, including general health and social capital. For each outcome, the most and least desirable locations formed two domains. Streetscape design was sampled using around 80,000 Google Street View images per domain. Generative adversarial networks translated these images from one domain to the other, preserving the main structure of the input image, but transforming the `style' from locations where self-reported health was bad to locations where it was good. These translations indicate that areas in Melbourne with good general health are characterised by sufficient green space and compactness of the urban environment, whilst streetscape imagery related to high social capital contained more and wider footpaths, fewer fences and more grass. Beyond identifying relationships, the method is a first step towards computer-generated design interventions that have the potential to improve population health and wellbeing.

* 20 pages, 8 figures. Preprint accepted for publication in Sustainable Cities and Society

Via

Access Paper or Ask Questions

Learning Linear Transformations for Fast Arbitrary Style Transfer

Aug 14, 2018

Xueting Li, Sifei Liu, Jan Kautz, Ming-Hsuan Yang

Figure 1 for Learning Linear Transformations for Fast Arbitrary Style Transfer

Figure 2 for Learning Linear Transformations for Fast Arbitrary Style Transfer

Figure 3 for Learning Linear Transformations for Fast Arbitrary Style Transfer

Figure 4 for Learning Linear Transformations for Fast Arbitrary Style Transfer

Abstract:Given a random pair of images, an arbitrary style transfer method extracts the feel from the reference image to synthesize an output based on the look of the other content image. Recent arbitrary style transfer methods transfer second order statistics from reference image onto content image via a multiplication between content image features and a transformation matrix, which is computed from features with a pre-determined algorithm. These algorithms either require computationally expensive operations, or fail to model the feature covariance and produce artifacts in synthesized images. Generalized from these methods, in this work, we derive the form of transformation matrix theoretically and present an arbitrary style transfer approach that learns the transformation matrix with a feed-forward network. Our algorithm is highly efficient yet allows a flexible combination of multi-level styles while preserving content affinity during style transfer process. We demonstrate the effectiveness of our approach on four tasks: artistic style transfer, video and photo-realistic style transfer as well as domain adaptation, including comparisons with the state-of-the-art methods.

Via

Access Paper or Ask Questions

Wavelet Domain Style Transfer for an Effective Perception-distortion Tradeoff in Single Image Super-Resolution

Oct 09, 2019

Xin Deng, Ren Yang, Mai Xu, Pier Luigi Dragotti

Figure 1 for Wavelet Domain Style Transfer for an Effective Perception-distortion Tradeoff in Single Image Super-Resolution

Figure 2 for Wavelet Domain Style Transfer for an Effective Perception-distortion Tradeoff in Single Image Super-Resolution

Figure 3 for Wavelet Domain Style Transfer for an Effective Perception-distortion Tradeoff in Single Image Super-Resolution

Figure 4 for Wavelet Domain Style Transfer for an Effective Perception-distortion Tradeoff in Single Image Super-Resolution

Abstract:In single image super-resolution (SISR), given a low-resolution (LR) image, one wishes to find a high-resolution (HR) version of it which is both accurate and photo-realistic. Recently, it has been shown that there exists a fundamental tradeoff between low distortion and high perceptual quality, and the generative adversarial network (GAN) is demonstrated to approach the perception-distortion (PD) bound effectively. In this paper, we propose a novel method based on wavelet domain style transfer (WDST), which achieves a better PD tradeoff than the GAN based methods. Specifically, we propose to use 2D stationary wavelet transform (SWT) to decompose one image into low-frequency and high-frequency sub-bands. For the low-frequency sub-band, we improve its objective quality through an enhancement network. For the high-frequency sub-band, we propose to use WDST to effectively improve its perceptual quality. By feat of the perfect reconstruction property of wavelets, these sub-bands can be re-combined to obtain an image which has simultaneously high objective and perceptual quality. The numerical results on various datasets show that our method achieves the best trade-off between the distortion and perceptual quality among the existing state-of-the-art SISR methods.

Via

Access Paper or Ask Questions

WarpGAN: Automatic Caricature Generation

Nov 28, 2018

Yichun Shi, Debayan Deb, Anil K. Jain

Figure 1 for WarpGAN: Automatic Caricature Generation

Figure 2 for WarpGAN: Automatic Caricature Generation

Figure 3 for WarpGAN: Automatic Caricature Generation

Figure 4 for WarpGAN: Automatic Caricature Generation

Abstract:We propose, WarpGAN, a fully automatic network that can generate caricatures given an input face photo. Besides transferring rich texture styles, WarpGAN learns to automatically predict a set of control points that can warp the photo into a caricature, while preserving identity. We introduce an identity-preserving adversarial loss that aids the discriminator to distinguish between different subjects. Moreover, WarpGAN allows customization of the generated caricatures by controlling the exaggeration extent and the visual styles. Experimental results on a public domain dataset, WebCaricature, show that WarpGAN is capable of generating a diverse set of caricatures while preserving the identities. Five caricature experts suggest that caricatures generated by WarpGAN are visually similar to hand-drawn ones and only prominent facial features are exaggerated.

Via

Access Paper or Ask Questions

Recapture as You Want

Jun 02, 2020

Chen Gao, Si Liu, Ran He, Shuicheng Yan, Bo Li

Abstract:With the increasing prevalence and more powerful camera systems of mobile devices, people can conveniently take photos in their daily life, which naturally brings the demand for more intelligent photo post-processing techniques, especially on those portrait photos. In this paper, we present a portrait recapture method enabling users to easily edit their portrait to desired posture/view, body figure and clothing style, which are very challenging to achieve since it requires to simultaneously perform non-rigid deformation of human body, invisible body-parts reasoning and semantic-aware editing. We decompose the editing procedure into semantic-aware geometric and appearance transformation. In geometric transformation, a semantic layout map is generated that meets user demands to represent part-level spatial constraints and further guides the semantic-aware appearance transformation. In appearance transformation, we design two novel modules, Semantic-aware Attentive Transfer (SAT) and Layout Graph Reasoning (LGR), to conduct intra-part transfer and inter-part reasoning, respectively. SAT module produces each human part by paying attention to the semantically consistent regions in the source portrait. It effectively addresses the non-rigid deformation issue and well preserves the intrinsic structure/appearance with rich texture details. LGR module utilizes body skeleton knowledge to construct a layout graph that connects all relevant part features, where graph reasoning mechanism is used to propagate information among part nodes to mine their relations. In this way, LGR module infers invisible body parts and guarantees global coherence among all the parts. Extensive experiments on DeepFashion, Market-1501 and in-the-wild photos demonstrate the effectiveness and superiority of our approach. Video demo is at: \url{https://youtu.be/vTyq9HL6jgw}.

* 14 pages

Via

Access Paper or Ask Questions

Deep Video Color Propagation

Aug 09, 2018

Simone Meyer, Victor Cornillère, Abdelaziz Djelouah, Christopher Schroers, Markus Gross

Figure 1 for Deep Video Color Propagation

Figure 2 for Deep Video Color Propagation

Figure 3 for Deep Video Color Propagation

Figure 4 for Deep Video Color Propagation

Abstract:Traditional approaches for color propagation in videos rely on some form of matching between consecutive video frames. Using appearance descriptors, colors are then propagated both spatially and temporally. These methods, however, are computationally expensive and do not take advantage of semantic information of the scene. In this work we propose a deep learning framework for color propagation that combines a local strategy, to propagate colors frame-by-frame ensuring temporal stability, and a global strategy, using semantics for color propagation within a longer range. Our evaluation shows the superiority of our strategy over existing video and image color propagation methods as well as neural photo-realistic style transfer approaches.

* BMVC 2018

Via

Access Paper or Ask Questions

A Closed-form Solution to Photorealistic Image Stylization

Jul 27, 2018

Yijun Li, Ming-Yu Liu, Xueting Li, Ming-Hsuan Yang, Jan Kautz

Figure 1 for A Closed-form Solution to Photorealistic Image Stylization

Figure 2 for A Closed-form Solution to Photorealistic Image Stylization

Figure 3 for A Closed-form Solution to Photorealistic Image Stylization

Figure 4 for A Closed-form Solution to Photorealistic Image Stylization

Abstract:Photorealistic image stylization concerns transferring style of a reference photo to a content photo with the constraint that the stylized photo should remain photorealistic. While several photorealistic image stylization methods exist, they tend to generate spatially inconsistent stylizations with noticeable artifacts. In this paper, we propose a method to address these issues. The proposed method consists of a stylization step and a smoothing step. While the stylization step transfers the style of the reference photo to the content photo, the smoothing step ensures spatially consistent stylizations. Each of the steps has a closed-form solution and can be computed efficiently. We conduct extensive experimental validations. The results show that the proposed method generates photorealistic stylization outputs that are more preferred by human subjects as compared to those by the competing methods while running much faster. Source code and additional results are available at https://github.com/NVIDIA/FastPhotoStyle .

* Accepted by ECCV 2018

Via

Access Paper or Ask Questions

Topic:photo style transfer

Papers and Code