Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Li Wang

Northeast Normal University

Learning Decoupling Features Through Orthogonality Regularization

Mar 31, 2022

Li Wang, Rongzhi Gu, Weiji Zhuang, Peng Gao, Yujun Wang, Yuexian Zou

Figure 1 for Learning Decoupling Features Through Orthogonality Regularization

Figure 2 for Learning Decoupling Features Through Orthogonality Regularization

Figure 3 for Learning Decoupling Features Through Orthogonality Regularization

Figure 4 for Learning Decoupling Features Through Orthogonality Regularization

Abstract:Keyword spotting (KWS) and speaker verification (SV) are two important tasks in speech applications. Research shows that the state-of-art KWS and SV models are trained independently using different datasets since they expect to learn distinctive acoustic features. However, humans can distinguish language content and the speaker identity simultaneously. Motivated by this, we believe it is important to explore a method that can effectively extract common features while decoupling task-specific features. Bearing this in mind, a two-branch deep network (KWS branch and SV branch) with the same network structure is developed and a novel decoupling feature learning method is proposed to push up the performance of KWS and SV simultaneously where speaker-invariant keyword representations and keyword-invariant speaker representations are expected respectively. Experiments are conducted on Google Speech Commands Dataset (GSCD). The results demonstrate that the orthogonality regularization helps the network to achieve SOTA EER of 1.31% and 1.87% on KWS and SV, respectively.

* Accepted at ICASSP 2022

Via

Access Paper or Ask Questions

Source-free Domain Adaptation for Multi-site and Lifespan Brain Skull Stripping

Mar 11, 2022

Yunxiang Li, Ruilong Dan, Shuai Wang, Yifan Cao, Xiangde Luo, Chenghao Tan, Gangyong Jia, Huiyu Zhou, Yaqi Wang, Li Wang

Figure 1 for Source-free Domain Adaptation for Multi-site and Lifespan Brain Skull Stripping

Figure 2 for Source-free Domain Adaptation for Multi-site and Lifespan Brain Skull Stripping

Figure 3 for Source-free Domain Adaptation for Multi-site and Lifespan Brain Skull Stripping

Figure 4 for Source-free Domain Adaptation for Multi-site and Lifespan Brain Skull Stripping

Abstract:Skull stripping is a crucial prerequisite step in the analysis of brain magnetic resonance (MR) images. Although many excellent works or tools have been proposed, they suffer from low generalization capability. For instance, the model trained on a dataset with specific imaging parameters (source domain) cannot be well applied to other datasets with different imaging parameters (target domain). Especially, for the lifespan datasets, the model trained on an adult dataset is not applicable to an infant dataset due to the large domain difference. To address this issue, numerous domain adaptation (DA) methods have been proposed to align the extracted features between the source and target domains, requiring concurrent access to the input images of both domains. Unfortunately, it is problematic to share the images due to privacy. In this paper, we design a source-free domain adaptation framework (SDAF) for multi-site and lifespan skull stripping that can accomplish domain adaptation without access to source domain images. Our method only needs to share the source labels as shape dictionaries and the weights trained on the source data, without disclosing private information from source domain subjects. To deal with the domain shift between multi-site lifespan datasets, we take advantage of the brain shape prior which is invariant to imaging parameters and ages. Experiments demonstrate that our framework can significantly outperform the state-of-the-art methods on multi-site lifespan datasets.

* 11 page

Via

Access Paper or Ask Questions

Computer-Aided Road Inspection: Systems and Algorithms

Mar 04, 2022

Rui Fan, Sicen Guo, Li Wang, Mohammud Junaid Bocus

Figure 1 for Computer-Aided Road Inspection: Systems and Algorithms

Figure 2 for Computer-Aided Road Inspection: Systems and Algorithms

Figure 3 for Computer-Aided Road Inspection: Systems and Algorithms

Figure 4 for Computer-Aided Road Inspection: Systems and Algorithms

Abstract:Road damage is an inconvenience and a safety hazard, severely affecting vehicle condition, driving comfort, and traffic safety. The traditional manual visual road inspection process is pricey, dangerous, exhausting, and cumbersome. Also, manual road inspection results are qualitative and subjective, as they depend entirely on the inspector's personal experience. Therefore, there is an ever-increasing need for automated road inspection systems. This chapter first compares the five most common road damage types. Then, 2-D/3-D road imaging systems are discussed. Finally, state-of-the-art machine vision and intelligence-based road damage detection algorithms are introduced.

Via

Access Paper or Ask Questions

A Framework for Multi-stage Bonus Allocation in meal delivery Platform

Feb 22, 2022

Zhuolin Wu, Li Wang, Fangsheng Huang, Linjun Zhou, Yu Song, Chengpeng Ye, Pengyu Nie, Hao Ren, Jinghua Hao, Renqing He(+1 more)

Figure 1 for A Framework for Multi-stage Bonus Allocation in meal delivery Platform

Figure 2 for A Framework for Multi-stage Bonus Allocation in meal delivery Platform

Figure 3 for A Framework for Multi-stage Bonus Allocation in meal delivery Platform

Figure 4 for A Framework for Multi-stage Bonus Allocation in meal delivery Platform

Abstract:Online meal delivery is undergoing explosive growth, as this service is becoming increasingly popular. A meal delivery platform aims to provide excellent and stable services for customers and restaurants. However, in reality, several hundred thousand orders are canceled per day in the Meituan meal delivery platform since they are not accepted by the crowd soucing drivers. The cancellation of the orders is incredibly detrimental to the customer's repurchase rate and the reputation of the Meituan meal delivery platform. To solve this problem, a certain amount of specific funds is provided by Meituan's business managers to encourage the crowdsourcing drivers to accept more orders. To make better use of the funds, in this work, we propose a framework to deal with the multi-stage bonus allocation problem for a meal delivery platform. The objective of this framework is to maximize the number of accepted orders within a limited bonus budget. This framework consists of a semi-black-box acceptance probability model, a Lagrangian dual-based dynamic programming algorithm, and an online allocation algorithm. The semi-black-box acceptance probability model is employed to forecast the relationship between the bonus allocated to order and its acceptance probability, the Lagrangian dual-based dynamic programming algorithm aims to calculate the empirical Lagrangian multiplier for each allocation stage offline based on the historical data set, and the online allocation algorithm uses the results attained in the offline part to calculate a proper delivery bonus for each order. To verify the effectiveness and efficiency of our framework, both offline experiments on a real-world data set and online A/B tests on the Meituan meal delivery platform are conducted. Our results show that using the proposed framework, the total order cancellations can be decreased by more than 25\% in reality.

* 9 pages; submit to KDD 2022

Via

Access Paper or Ask Questions

Seeing is Living? Rethinking the Security of Facial Liveness Verification in the Deepfake Era

Feb 22, 2022

Changjiang Li, Li Wang, Shouling Ji, Xuhong Zhang, Zhaohan Xi, Shanqing Guo, Ting Wang

Figure 1 for Seeing is Living? Rethinking the Security of Facial Liveness Verification in the Deepfake Era

Figure 2 for Seeing is Living? Rethinking the Security of Facial Liveness Verification in the Deepfake Era

Figure 3 for Seeing is Living? Rethinking the Security of Facial Liveness Verification in the Deepfake Era

Figure 4 for Seeing is Living? Rethinking the Security of Facial Liveness Verification in the Deepfake Era

Abstract:Facial Liveness Verification (FLV) is widely used for identity authentication in many security-sensitive domains and offered as Platform-as-a-Service (PaaS) by leading cloud vendors. Yet, with the rapid advances in synthetic media techniques (e.g., deepfake), the security of FLV is facing unprecedented challenges, about which little is known thus far. To bridge this gap, in this paper, we conduct the first systematic study on the security of FLV in real-world settings. Specifically, we present LiveBugger, a new deepfake-powered attack framework that enables customizable, automated security evaluation of FLV. Leveraging LiveBugger, we perform a comprehensive empirical assessment of representative FLV platforms, leading to a set of interesting findings. For instance, most FLV APIs do not use anti-deepfake detection; even for those with such defenses, their effectiveness is concerning (e.g., it may detect high-quality synthesized videos but fail to detect low-quality ones). We then conduct an in-depth analysis of the factors impacting the attack performance of LiveBugger: a) the bias (e.g., gender or race) in FLV can be exploited to select victims; b) adversarial training makes deepfake more effective to bypass FLV; c) the input quality has a varying influence on different deepfake techniques to bypass FLV. Based on these findings, we propose a customized, two-stage approach that can boost the attack success rate by up to 70%. Further, we run proof-of-concept attacks on several representative applications of FLV (i.e., the clients of FLV APIs) to illustrate the practical implications: due to the vulnerability of the APIs, many downstream applications are vulnerable to deepfake. Finally, we discuss potential countermeasures to improve the security of FLV. Our findings have been confirmed by the corresponding vendors.

* Accepted as a full paper at USENIX Security '22

Via

Access Paper or Ask Questions

Exploiting Data Sparsity in Secure Cross-Platform Social Recommendation

Feb 15, 2022

Jamie Cui, Chaochao Chen, Lingjuan Lyu, Carl Yang, Li Wang

Figure 1 for Exploiting Data Sparsity in Secure Cross-Platform Social Recommendation

Figure 2 for Exploiting Data Sparsity in Secure Cross-Platform Social Recommendation

Figure 3 for Exploiting Data Sparsity in Secure Cross-Platform Social Recommendation

Figure 4 for Exploiting Data Sparsity in Secure Cross-Platform Social Recommendation

Abstract:Social recommendation has shown promising improvements over traditional systems since it leverages social correlation data as an additional input. Most existing work assumes that all data are available to the recommendation platform. However, in practice, user-item interaction data (e.g.,rating) and user-user social data are usually generated by different platforms, and both of which contain sensitive information. Therefore, "How to perform secure and efficient social recommendation across different platforms, where the data are highly-sparse in nature" remains an important challenge. In this work, we bring secure computation techniques into social recommendation, and propose S3Rec, a sparsity-aware secure cross-platform social recommendation framework. As a result, our model can not only improve the recommendation performance of the rating platform by incorporating the sparse social data on the social platform, but also protect data privacy of both platforms. Moreover, to further improve model training efficiency, we propose two secure sparse matrix multiplication protocols based on homomorphic encryption and private information retrieval. Our experiments on two benchmark datasets demonstrate the effectiveness of S3Rec.

Via

Access Paper or Ask Questions

Differential Private Knowledge Transfer for Privacy-Preserving Cross-Domain Recommendation

Feb 10, 2022

Chaochao Chen, Huiwen Wu, Jiajie Su, Lingjuan Lyu, Xiaolin Zheng, Li Wang

Figure 1 for Differential Private Knowledge Transfer for Privacy-Preserving Cross-Domain Recommendation

Figure 2 for Differential Private Knowledge Transfer for Privacy-Preserving Cross-Domain Recommendation

Figure 3 for Differential Private Knowledge Transfer for Privacy-Preserving Cross-Domain Recommendation

Figure 4 for Differential Private Knowledge Transfer for Privacy-Preserving Cross-Domain Recommendation

Abstract:Cross Domain Recommendation (CDR) has been popularly studied to alleviate the cold-start and data sparsity problem commonly existed in recommender systems. CDR models can improve the recommendation performance of a target domain by leveraging the data of other source domains. However, most existing CDR models assume information can directly 'transfer across the bridge', ignoring the privacy issues. To solve the privacy concern in CDR, in this paper, we propose a novel two stage based privacy-preserving CDR framework (PriCDR). In the first stage, we propose two methods, i.e., Johnson-Lindenstrauss Transform (JLT) based and Sparse-awareJLT (SJLT) based, to publish the rating matrix of the source domain using differential privacy. We theoretically analyze the privacy and utility of our proposed differential privacy based rating publishing methods. In the second stage, we propose a novel heterogeneous CDR model (HeteroCDR), which uses deep auto-encoder and deep neural network to model the published source rating matrix and target rating matrix respectively. To this end, PriCDR can not only protect the data privacy of the source domain, but also alleviate the data sparsity of the source domain. We conduct experiments on two benchmark datasets and the results demonstrate the effectiveness of our proposed PriCDR and HeteroCDR.

* Accepted by TheWebConf'22 (WWW'22)

Via

Access Paper or Ask Questions

Higher Order Correlation Analysis for Multi-View Learning

Jan 28, 2022

Jiawang Nie, Li Wang, Zequn Zheng

Figure 1 for Higher Order Correlation Analysis for Multi-View Learning

Figure 2 for Higher Order Correlation Analysis for Multi-View Learning

Figure 3 for Higher Order Correlation Analysis for Multi-View Learning

Figure 4 for Higher Order Correlation Analysis for Multi-View Learning

Abstract:Multi-view learning is frequently used in data science. The pairwise correlation maximization is a classical approach for exploring the consensus of multiple views. Since the pairwise correlation is inherent for two views, the extensions to more views can be diversified and the intrinsic interconnections among views are generally lost. To address this issue, we propose to maximize higher order correlations. This can be formulated as a low rank approximation problem with the higher order correlation tensor of multi-view data. We use the generating polynomial method to solve the low rank approximation problem. Numerical results on real multi-view data demonstrate that this method consistently outperforms prior existing methods.

Via

Access Paper or Ask Questions

Backdoor Defense with Machine Unlearning

Jan 24, 2022

Yang Liu, Mingyuan Fan, Cen Chen, Ximeng Liu, Zhuo Ma, Li Wang, Jianfeng Ma

Figure 1 for Backdoor Defense with Machine Unlearning

Figure 2 for Backdoor Defense with Machine Unlearning

Figure 3 for Backdoor Defense with Machine Unlearning

Figure 4 for Backdoor Defense with Machine Unlearning

Abstract:Backdoor injection attack is an emerging threat to the security of neural networks, however, there still exist limited effective defense methods against the attack. In this paper, we propose BAERASE, a novel method that can erase the backdoor injected into the victim model through machine unlearning. Specifically, BAERASE mainly implements backdoor defense in two key steps. First, trigger pattern recovery is conducted to extract the trigger patterns infected by the victim model. Here, the trigger pattern recovery problem is equivalent to the one of extracting an unknown noise distribution from the victim model, which can be easily resolved by the entropy maximization based generative model. Subsequently, BAERASE leverages these recovered trigger patterns to reverse the backdoor injection procedure and induce the victim model to erase the polluted memories through a newly designed gradient ascent based machine unlearning method. Compared with the previous machine unlearning solutions, the proposed approach gets rid of the reliance on the full access to training data for retraining and shows higher effectiveness on backdoor erasing than existing fine-tuning or pruning methods. Moreover, experiments show that BAERASE can averagely lower the attack success rates of three kinds of state-of-the-art backdoor attacks by 99\% on four benchmark datasets.

Via

Access Paper or Ask Questions

The Implicit Regularization of Momentum Gradient Descent with Early Stopping

Jan 14, 2022

Li Wang, Yingcong Zhou, Zhiguo Fu

Figure 1 for The Implicit Regularization of Momentum Gradient Descent with Early Stopping

Figure 2 for The Implicit Regularization of Momentum Gradient Descent with Early Stopping

Abstract:The study on the implicit regularization induced by gradient-based optimization is a longstanding pursuit. In the present paper, we characterize the implicit regularization of momentum gradient descent (MGD) with early stopping by comparing with the explicit $\ell_2$-regularization (ridge). In details, we study MGD in the continuous-time view, so-called momentum gradient flow (MGF), and show that its tendency is closer to ridge than the gradient descent (GD) [Ali et al., 2019] for least squares regression. Moreover, we prove that, under the calibration $t=\sqrt{2/\lambda}$, where $t$ is the time parameter in MGF and $\lambda$ is the tuning parameter in ridge regression, the risk of MGF is no more than 1.54 times that of ridge. In particular, the relative Bayes risk of MGF to ridge is between 1 and 1.035 under the optimal tuning. The numerical experiments support our theoretical results strongly.

* 7 pages, 2 figures

Via

Access Paper or Ask Questions