Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yuan Yao

Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution

Jun 11, 2021
Fanchao Qi, Yuan Yao, Sophia Xu, Zhiyuan Liu, Maosong Sun

Figure 1 for Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution

Figure 2 for Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution

Figure 3 for Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution

Figure 4 for Turn the Combination Lock: Learnable Textual Backdoor Attacks via Word Substitution

Recent studies show that neural natural language processing (NLP) models are vulnerable to backdoor attacks. Injected with backdoors, models perform normally on benign examples but produce attacker-specified predictions when the backdoor is activated, presenting serious security threats to real-world applications. Since existing textual backdoor attacks pay little attention to the invisibility of backdoors, they can be easily detected and blocked. In this work, we present invisible backdoors that are activated by a learnable combination of word substitution. We show that NLP models can be injected with backdoors that lead to a nearly 100% attack success rate, whereas being highly invisible to existing defense strategies and even human inspections. The results raise a serious alarm to the security of NLP models, which requires further research to be resolved. All the data and code of this paper are released at https://github.com/thunlp/BkdAtk-LWS.

* Accepted by the main conference of ACL-IJCNLP as a long paper. Camera-ready version

Via

Access Paper or Ask Questions

Image-to-Video Generation via 3D Facial Dynamics

May 31, 2021
Xiaoguang Tu, Yingtian Zou, Jian Zhao, Wenjie Ai, Jian Dong, Yuan Yao, Zhikang Wang, Guodong Guo, Zhifeng Li, Wei Liu, Jiashi Feng

Figure 1 for Image-to-Video Generation via 3D Facial Dynamics

Figure 2 for Image-to-Video Generation via 3D Facial Dynamics

Figure 3 for Image-to-Video Generation via 3D Facial Dynamics

Figure 4 for Image-to-Video Generation via 3D Facial Dynamics

We present a versatile model, FaceAnime, for various video generation tasks from still images. Video generation from a single face image is an interesting problem and usually tackled by utilizing Generative Adversarial Networks (GANs) to integrate information from the input face image and a sequence of sparse facial landmarks. However, the generated face images usually suffer from quality loss, image distortion, identity change, and expression mismatching due to the weak representation capacity of the facial landmarks. In this paper, we propose to "imagine" a face video from a single face image according to the reconstructed 3D face dynamics, aiming to generate a realistic and identity-preserving face video, with precisely predicted pose and facial expression. The 3D dynamics reveal changes of the facial expression and motion, and can serve as a strong prior knowledge for guiding highly realistic face video generation. In particular, we explore face video prediction and exploit a well-designed 3D dynamic prediction network to predict a 3D dynamic sequence for a single face image. The 3D dynamics are then further rendered by the sparse texture mapping algorithm to recover structural details and sparse textures for generating face frames. Our model is versatile for various AR/VR and entertainment applications, such as face video retargeting and face video prediction. Superior experimental results have well demonstrated its effectiveness in generating high-fidelity, identity-preserving, and visually pleasant face video clips from a single source face image.

Via

Access Paper or Ask Questions

Visual Distant Supervision for Scene Graph Generation

Mar 29, 2021
Yuan Yao, Ao Zhang, Xu Han, Mengdi Li, Cornelius Weber, Zhiyuan Liu, Stefan Wermter, Maosong Sun

Figure 1 for Visual Distant Supervision for Scene Graph Generation

Figure 2 for Visual Distant Supervision for Scene Graph Generation

Figure 3 for Visual Distant Supervision for Scene Graph Generation

Figure 4 for Visual Distant Supervision for Scene Graph Generation

Scene graph generation aims to identify objects and their relations in images, providing structured image representations that can facilitate numerous applications in computer vision. However, scene graph models usually require supervised learning on large quantities of labeled data with intensive human annotation. In this work, we propose visual distant supervision, a novel paradigm of visual relation learning, which can train scene graph models without any human-labeled data. The intuition is that by aligning commonsense knowledge bases and images, we can automatically create large-scale labeled data to provide distant supervision for visual relation learning. To alleviate the noise in distantly labeled data, we further propose a framework that iteratively estimates the probabilistic relation labels and eliminates the noisy ones. Comprehensive experimental results show that our distantly supervised model outperforms strong weakly supervised and semi-supervised baselines. By further incorporating human-labeled data in a semi-supervised fashion, our model outperforms state-of-the-art fully supervised models by a large margin (e.g., 8.6 micro- and 7.6 macro-recall@50 improvements for predicate classification in Visual Genome evaluation). All the data and code will be available to facilitate future research.

* 14 pages, 6 figures

Via

Access Paper or Ask Questions

UPRec: User-Aware Pre-training for Recommender Systems

Feb 22, 2021
Chaojun Xiao, Ruobing Xie, Yuan Yao, Zhiyuan Liu, Maosong Sun, Xu Zhang, Leyu Lin

Figure 1 for UPRec: User-Aware Pre-training for Recommender Systems

Figure 2 for UPRec: User-Aware Pre-training for Recommender Systems

Figure 3 for UPRec: User-Aware Pre-training for Recommender Systems

Figure 4 for UPRec: User-Aware Pre-training for Recommender Systems

Existing sequential recommendation methods rely on large amounts of training data and usually suffer from the data sparsity problem. To tackle this, the pre-training mechanism has been widely adopted, which attempts to leverage large-scale data to perform self-supervised learning and transfer the pre-trained parameters to downstream tasks. However, previous pre-trained models for recommendation focus on leverage universal sequence patterns from user behaviour sequences and item information, whereas ignore capturing personalized interests with the heterogeneous user information, which has been shown effective in contributing to personalized recommendation. In this paper, we propose a method to enhance pre-trained models with heterogeneous user information, called User-aware Pre-training for Recommendation (UPRec). Specifically, UPRec leverages the user attributes andstructured social graphs to construct self-supervised objectives in the pre-training stage and proposes two user-aware pre-training tasks. Comprehensive experimental results on several real-world large-scale recommendation datasets demonstrate that UPRec can effectively integrate user information into pre-trained models and thus provide more appropriate recommendations for users.

* This paper has been submitted to IEEE TKDE

Via

Access Paper or Ask Questions

Polyimide-Based Flexible Coupled-Coils Design and Load-Shift Keying Analysis

Feb 02, 2021
Yuan Yao, Wing-Hung Ki, Chi-Ying Tsui

Figure 1 for Polyimide-Based Flexible Coupled-Coils Design and Load-Shift Keying Analysis

Figure 2 for Polyimide-Based Flexible Coupled-Coils Design and Load-Shift Keying Analysis

Figure 3 for Polyimide-Based Flexible Coupled-Coils Design and Load-Shift Keying Analysis

Figure 4 for Polyimide-Based Flexible Coupled-Coils Design and Load-Shift Keying Analysis

Wireless power transfer using inductive coupling is commonly used for medical implantable devices. The design of the secondary coil on the implantable device is important as it will affect the power transfer efficiency, the size of the implant, and also the data transmission between the implant and the in-vitro controller. In this paper, we present a design of the secondary coil on a polyimide-based flexible substrate to achieve high power transfer efficiency. Load shift keying modulation is used for the data communication between the primary and secondary coils. A thorough analysis is done for the ideal and practical scenario and it shows that a mismatched secondary LC tank will affect the communication range and communication correctness. A solution to achieve robust data transmission is proposed and then verified by SPICE simulations.

* 4 pages, 8 figures

Via

Access Paper or Ask Questions

StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

Jan 11, 2021
Jinshan Zeng, Qi Chen, Yunxin Liu, Mingwen Wang, Yuan Yao

Figure 1 for StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

Figure 2 for StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

Figure 3 for StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

Figure 4 for StrokeGAN: Reducing Mode Collapse in Chinese Font Generation via Stroke Encoding

The generation of stylish Chinese fonts is an important problem involved in many applications. Most of existing generation methods are based on the deep generative models, particularly, the generative adversarial networks (GAN) based models. However, these deep generative models may suffer from the mode collapse issue, which significantly degrades the diversity and quality of generated results. In this paper, we introduce a one-bit stroke encoding to capture the key mode information of Chinese characters and then incorporate it into CycleGAN, a popular deep generative model for Chinese font generation. As a result we propose an efficient method called StrokeGAN, mainly motivated by the observation that the stroke encoding contains amount of mode information of Chinese characters. In order to reconstruct the one-bit stroke encoding of the associated generated characters, we introduce a stroke-encoding reconstruction loss imposed on the discriminator. Equipped with such one-bit stroke encoding and stroke-encoding reconstruction loss, the mode collapse issue of CycleGAN can be significantly alleviated, with an improved preservation of strokes and diversity of generated characters. The effectiveness of StrokeGAN is demonstrated by a series of generation tasks over nine datasets with different fonts. The numerical results demonstrate that StrokeGAN generally outperforms the state-of-the-art methods in terms of content and recognition accuracies, as well as certain stroke error, and also generates more realistic characters.

* AAAI 2021
* 10 pages, our codes and data are available at: https://github.com/JinshanZeng/StrokeGAN

Via

Access Paper or Ask Questions

On Stochastic Variance Reduced Gradient Method for Semidefinite Optimization

Jan 01, 2021
Jinshan Zeng, Yixuan Zha, Ke Ma, Yuan Yao

Figure 1 for On Stochastic Variance Reduced Gradient Method for Semidefinite Optimization

Figure 2 for On Stochastic Variance Reduced Gradient Method for Semidefinite Optimization

Figure 3 for On Stochastic Variance Reduced Gradient Method for Semidefinite Optimization

Figure 4 for On Stochastic Variance Reduced Gradient Method for Semidefinite Optimization

The low-rank stochastic semidefinite optimization has attracted rising attention due to its wide range of applications. The nonconvex reformulation based on the low-rank factorization, significantly improves the computational efficiency but brings some new challenge to the analysis. The stochastic variance reduced gradient (SVRG) method has been regarded as one of the most effective methods. SVRG in general consists of two loops, where a reference full gradient is first evaluated in the outer loop and then used to yield a variance reduced estimate of the current gradient in the inner loop. Two options have been suggested to yield the output of the inner loop, where Option I sets the output as its last iterate, and Option II yields the output via random sampling from all the iterates in the inner loop. However, there is a significant gap between the theory and practice of SVRG when adapted to the stochastic semidefinite programming (SDP). SVRG practically works better with Option I, while most of existing theoretical results focus on Option II. In this paper, we fill this gap via exploiting a new semi-stochastic variant of the original SVRG with Option I adapted to the semidefinite optimization. Equipped with this, we establish the global linear submanifold convergence (i.e., converging exponentially fast to a submanifold of a global minimum under the orthogonal group action) of the proposed SVRG method, given a provable initialization scheme and under certain smoothness and restricted strongly convex assumptions. Our analysis includes the effects of the mini-batch size and update frequency in the inner loop as well as two practical step size strategies, the fixed and stabilized Barzilai-Borwein step sizes. Some numerical results in matrix sensing demonstrate the efficiency of proposed SVRG method outperforming Option II counterpart as well as others.

* 27 pages, 5 figures

Via

Access Paper or Ask Questions

An exact solution in Markov decision process with multiplicative rewards as a general framework

Dec 15, 2020
Yuan Yao, Xiaolin Sun

We develop an exactly solvable framework of Markov decision process with a finite horizon, and continuous state and action spaces. We first review the exact solution of conventional linear quadratic regulation with a linear transition and a Gaussian noise, whose optimal policy does not depend on the Gaussian noise, which is an undesired feature in the presence of significant noises. It motivates us to investigate exact solutions which depend on noise. To do so, we generalize the reward accumulation to be a general binary commutative and associative operation. By a new multiplicative accumulation, we obtain an exact solution of optimization assuming linear transitions with a Gaussian noise and the optimal policy is noise dependent in contrast to the additive accumulation. Furthermore, we also show that the multiplicative scheme is a general framework that covers the additive one with an arbitrary precision, which is a model-independent principle.

* 11 pages

Via

Access Paper or Ask Questions

Denoising Relation Extraction from Document-level Distant Supervision

Nov 08, 2020
Chaojun Xiao, Yuan Yao, Ruobing Xie, Xu Han, Zhiyuan Liu, Maosong Sun, Fen Lin, Leyu Lin

Figure 1 for Denoising Relation Extraction from Document-level Distant Supervision

Figure 2 for Denoising Relation Extraction from Document-level Distant Supervision

Figure 3 for Denoising Relation Extraction from Document-level Distant Supervision

Figure 4 for Denoising Relation Extraction from Document-level Distant Supervision

Distant supervision (DS) has been widely used to generate auto-labeled data for sentence-level relation extraction (RE), which improves RE performance. However, the existing success of DS cannot be directly transferred to the more challenging document-level relation extraction (DocRE), since the inherent noise in DS may be even multiplied in document level and significantly harm the performance of RE. To address this challenge, we propose a novel pre-trained model for DocRE, which denoises the document-level DS data via multiple pre-training tasks. Experimental results on the large-scale DocRE benchmark show that our model can capture useful information from noisy DS data and achieve promising results.

* EMNLP 2020 short paper

Via

Access Paper or Ask Questions