Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jiali Duan

PortraitGAN for Flexible Portrait Manipulation

Jul 05, 2018

Jiali Duan, Xiaoyuan Guo, Yuhang Song, Chao Yang, C. -C. Jay Kuo

Figure 1 for PortraitGAN for Flexible Portrait Manipulation

Figure 2 for PortraitGAN for Flexible Portrait Manipulation

Figure 3 for PortraitGAN for Flexible Portrait Manipulation

Figure 4 for PortraitGAN for Flexible Portrait Manipulation

Abstract:Previous methods have dealt with discrete manipulation of facial attributes such as smile, sad, angry, surprise etc, out of canonical expressions and they are not scalable, operating in single modality. In this paper, we propose a novel framework that supports continuous edits and multi-modality portrait manipulation using adversarial learning. Specifically, we adapt cycle-consistency into the conditional setting by leveraging additional facial landmarks information. This has two effects: first cycle mapping induces bidirectional manipulation and identity preserving; second pairing samples from different modalities can thus be utilized. To ensure high-quality synthesis, we adopt texture-loss that enforces texture consistency and multi-level adversarial supervision that facilitates gradient flow. Quantitative and qualitative experiments show the effectiveness of our framework in performing flexible and multi-modality portrait manipulation with photo-realistic effects.

Via

Access Paper or Ask Questions

Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition

Nov 28, 2016

Jiali Duan, Shuai Zhou, Jun Wan, Xiaoyuan Guo, Stan Z. Li

Figure 1 for Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition

Figure 2 for Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition

Figure 3 for Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition

Figure 4 for Multi-Modality Fusion based on Consensus-Voting and 3D Convolution for Isolated Gesture Recognition

Abstract:Recently, the popularity of depth-sensors such as Kinect has made depth videos easily available while its advantages have not been fully exploited. This paper investigates, for gesture recognition, to explore the spatial and temporal information complementarily embedded in RGB and depth sequences. We propose a convolutional twostream consensus voting network (2SCVN) which explicitly models both the short-term and long-term structure of the RGB sequences. To alleviate distractions from background, a 3d depth-saliency ConvNet stream (3DDSN) is aggregated in parallel to identify subtle motion characteristics. These two components in an unified framework significantly improve the recognition accuracy. On the challenging Chalearn IsoGD benchmark, our proposed method outperforms the first place on the leader-board by a large margin (10.29%) while also achieving the best result on RGBD-HuDaAct dataset (96.74%). Both quantitative experiments and qualitative analysis shows the effectiveness of our proposed framework and codes will be released to facilitate future research.

Via

Access Paper or Ask Questions