Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Han Wang

DPOAD: Differentially Private Outsourcing of Anomaly Detection through Iterative Sensitivity Learning

Jun 27, 2022
Meisam Mohammady, Han Wang, Lingyu Wang, Mengyuan Zhang, Yosr Jarraya, Suryadipta Majumdar, Makan Pourzandi, Mourad Debbabi, Yuan Hong

Figure 1 for DPOAD: Differentially Private Outsourcing of Anomaly Detection through Iterative Sensitivity Learning

Figure 2 for DPOAD: Differentially Private Outsourcing of Anomaly Detection through Iterative Sensitivity Learning

Figure 3 for DPOAD: Differentially Private Outsourcing of Anomaly Detection through Iterative Sensitivity Learning

Figure 4 for DPOAD: Differentially Private Outsourcing of Anomaly Detection through Iterative Sensitivity Learning

Outsourcing anomaly detection to third-parties can allow data owners to overcome resource constraints (e.g., in lightweight IoT devices), facilitate collaborative analysis (e.g., under distributed or multi-party scenarios), and benefit from lower costs and specialized expertise (e.g., of Managed Security Service Providers). Despite such benefits, a data owner may feel reluctant to outsource anomaly detection without sufficient privacy protection. To that end, most existing privacy solutions would face a novel challenge, i.e., preserving privacy usually requires the difference between data entries to be eliminated or reduced, whereas anomaly detection critically depends on that difference. Such a conflict is recently resolved under a local analysis setting with trusted analysts (where no outsourcing is involved) through moving the focus of differential privacy (DP) guarantee from "all" to only "benign" entries. In this paper, we observe that such an approach is not directly applicable to the outsourcing setting, because data owners do not know which entries are "benign" prior to outsourcing, and hence cannot selectively apply DP on data entries. Therefore, we propose a novel iterative solution for the data owner to gradually "disentangle" the anomalous entries from the benign ones such that the third-party analyst can produce accurate anomaly results with sufficient DP guarantee. We design and implement our Differentially Private Outsourcing of Anomaly Detection (DPOAD) framework, and demonstrate its benefits over baseline Laplace and PainFree mechanisms through experiments with real data from different application domains.

Via

Access Paper or Ask Questions

DeePKS+ABACUS as a Bridge between Expensive Quantum Mechanical Models and Machine Learning Potentials

Jun 21, 2022
Wenfei Li, Qi Ou, Yixiao Chen, Yu Cao, Renxi Liu, Chunyi Zhang, Daye Zheng, Chun Cai, Xifan Wu, Han Wang, Mohan Chen, Linfeng Zhang

Figure 1 for DeePKS+ABACUS as a Bridge between Expensive Quantum Mechanical Models and Machine Learning Potentials

Figure 2 for DeePKS+ABACUS as a Bridge between Expensive Quantum Mechanical Models and Machine Learning Potentials

Figure 3 for DeePKS+ABACUS as a Bridge between Expensive Quantum Mechanical Models and Machine Learning Potentials

Figure 4 for DeePKS+ABACUS as a Bridge between Expensive Quantum Mechanical Models and Machine Learning Potentials

Recently, the development of machine learning (ML) potentials has made it possible to perform large-scale and long-time molecular simulations with the accuracy of quantum mechanical (QM) models. However, for high-level QM methods, such as density functional theory (DFT) at the meta-GGA level and/or with exact exchange, quantum Monte Carlo, etc., generating a sufficient amount of data for training a ML potential has remained computationally challenging due to their high cost. In this work, we demonstrate that this issue can be largely alleviated with Deep Kohn-Sham (DeePKS), a ML-based DFT model. DeePKS employs a computationally efficient neural network-based functional model to construct a correction term added upon a cheap DFT model. Upon training, DeePKS offers closely-matched energies and forces compared with high-level QM method, but the number of training data required is orders of magnitude less than that required for training a reliable ML potential. As such, DeePKS can serve as a bridge between expensive QM models and ML potentials: one can generate a decent amount of high-accuracy QM data to train a DeePKS model, and then use the DeePKS model to label a much larger amount of configurations to train a ML potential. This scheme for periodic systems is implemented in a DFT package ABACUS, which is open-source and ready for use in various applications.

Via

Access Paper or Ask Questions

RVAE-LAMOL: Residual Variational Autoencoder to Enhance Lifelong Language Learning

May 22, 2022
Han Wang, Ruiliu Fu, Xuejun Zhang, Jun Zhou

Figure 1 for RVAE-LAMOL: Residual Variational Autoencoder to Enhance Lifelong Language Learning

Figure 2 for RVAE-LAMOL: Residual Variational Autoencoder to Enhance Lifelong Language Learning

Figure 3 for RVAE-LAMOL: Residual Variational Autoencoder to Enhance Lifelong Language Learning

Figure 4 for RVAE-LAMOL: Residual Variational Autoencoder to Enhance Lifelong Language Learning

Lifelong Language Learning (LLL) aims to train a neural network to learn a stream of NLP tasks while retaining knowledge from previous tasks. However, previous works which followed data-free constraint still suffer from catastrophic forgetting issue, where the model forgets what it just learned from previous tasks. In order to alleviate catastrophic forgetting, we propose the residual variational autoencoder (RVAE) to enhance LAMOL, a recent LLL model, by mapping different tasks into a limited unified semantic space. In this space, previous tasks are easy to be correct to their own distribution by pseudo samples. Furthermore, we propose an identity task to make the model is discriminative to recognize the sample belonging to which task. For training RVAE-LAMOL better, we propose a novel training scheme Alternate Lag Training. In the experiments, we test RVAE-LAMOL on permutations of three datasets from DecaNLP. The experimental results demonstrate that RVAE-LAMOL outperforms na\"ive LAMOL on all permutations and generates more meaningful pseudo-samples.

* This paper has been accepted for publication at IJCNN 2022 on IEEE WCCI 2022; Oral presentation

Via

Access Paper or Ask Questions

No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

May 18, 2022
Han Wang, Archit Sakhadeo, Adam White, James Bell, Vincent Liu, Xutong Zhao, Puer Liu, Tadashi Kozuno, Alona Fyshe, Martha White

Figure 1 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

Figure 2 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

Figure 3 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

Figure 4 for No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

The performance of reinforcement learning (RL) agents is sensitive to the choice of hyperparameters. In real-world settings like robotics or industrial control systems, however, testing different hyperparameter configurations directly on the environment can be financially prohibitive, dangerous, or time consuming. We propose a new approach to tune hyperparameters from offline logs of data, to fully specify the hyperparameters for an RL agent that learns online in the real world. The approach is conceptually simple: we first learn a model of the environment from the offline data, which we call a calibration model, and then simulate learning in the calibration model to identify promising hyperparameters. We identify several criteria to make this strategy effective, and develop an approach that satisfies these criteria. We empirically investigate the method in a variety of settings to identify when it is effective and when it fails.

Via

Access Paper or Ask Questions

Multi-modal Semantic SLAM for Complex Dynamic Environments

May 14, 2022
Han Wang, Jing Ying Ko, Lihua Xie

Figure 1 for Multi-modal Semantic SLAM for Complex Dynamic Environments

Figure 2 for Multi-modal Semantic SLAM for Complex Dynamic Environments

Figure 3 for Multi-modal Semantic SLAM for Complex Dynamic Environments

Figure 4 for Multi-modal Semantic SLAM for Complex Dynamic Environments

Simultaneous Localization and Mapping (SLAM) is one of the most essential techniques in many real-world robotic applications. The assumption of static environments is common in most SLAM algorithms, which however, is not the case for most applications. Recent work on semantic SLAM aims to understand the objects in an environment and distinguish dynamic information from a scene context by performing image-based segmentation. However, the segmentation results are often imperfect or incomplete, which can subsequently reduce the quality of mapping and the accuracy of localization. In this paper, we present a robust multi-modal semantic framework to solve the SLAM problem in complex and highly dynamic environments. We propose to learn a more powerful object feature representation and deploy the mechanism of looking and thinking twice to the backbone network, which leads to a better recognition result to our baseline instance segmentation model. Moreover, both geometric-only clustering and visual semantic information are combined to reduce the effect of segmentation error due to small-scale objects, occlusion and motion blur. Thorough experiments have been conducted to evaluate the performance of the proposed method. The results show that our method can precisely identify dynamic objects under recognition imperfection and motion blur. Moreover, the proposed SLAM framework is able to efficiently build a static dense map at a processing rate of more than 10 Hz, which can be implemented in many practical applications. Both training data and the proposed method is open sourced at https://github.com/wh200720041/MMS_SLAM.

Via

Access Paper or Ask Questions

DouFu: A Double Fusion Joint Learning Method For Driving Trajectory Representation

May 05, 2022
Han Wang, Zhou Huang, Xiao Zhou, Ganmin Yin, Yi Bao

Figure 1 for DouFu: A Double Fusion Joint Learning Method For Driving Trajectory Representation

Figure 2 for DouFu: A Double Fusion Joint Learning Method For Driving Trajectory Representation

Figure 3 for DouFu: A Double Fusion Joint Learning Method For Driving Trajectory Representation

Figure 4 for DouFu: A Double Fusion Joint Learning Method For Driving Trajectory Representation

Driving trajectory representation learning is of great significance for various location-based services, such as driving pattern mining and route recommendation. However, previous representation generation approaches tend to rarely address three challenges: 1) how to represent the intricate semantic intentions of mobility inexpensively; 2) complex and weak spatial-temporal dependencies due to the sparsity and heterogeneity of the trajectory data; 3) route selection preferences and their correlation to driving behavior. In this paper, we propose a novel multimodal fusion model, DouFu, for trajectory representation joint learning, which applies multimodal learning and attention fusion module to capture the internal characteristics of trajectories. We first design movement, route, and global features generated from the trajectory data and urban functional zones and then analyze them respectively with the attention encoder or feed forward network. The attention fusion module incorporates route features with movement features to create a better spatial-temporal embedding. With the global semantic feature, DouFu produces a comprehensive embedding for each trajectory. We evaluate representations generated by our method and other baseline models on classification and clustering tasks. Empirical results show that DouFu outperforms other models in most of the learning algorithms like the linear regression and the support vector machine by more than 10%.

* 11 pages, 7 figures

Via

Access Paper or Ask Questions

Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Apr 18, 2022
Fan Ding, Jianping He, Yi Ren, Han Wang, Yu Zheng

Figure 1 for Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Figure 2 for Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Figure 3 for Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Figure 4 for Configuration-Aware Safe Control for Mobile Robotic Arm with Control Barrier Functions

Collision avoidance is a widely investigated topic in robotic applications. When applying collision avoidance techniques to a mobile robot, how to deal with the spatial structure of the robot still remains a challenge. In this paper, we design a configuration-aware safe control law by solving a Quadratic Programming (QP) with designed Control Barrier Functions (CBFs) constraints, which can safely navigate a mobile robotic arm to a desired region while avoiding collision with environmental obstacles. The advantage of our approach is that it correctly and in an elegant way incorporates the spatial structure of the mobile robotic arm. This is achieved by merging geometric restrictions among mobile robotic arm links into CBFs constraints. Simulations on a rigid rod and the modeled mobile robotic arm are performed to verify the feasibility and time-efficiency of proposed method. Numerical results about the time consuming for different degrees of freedom illustrate that our method scales well with dimension.

* submitted to Conference of Decision and Control(CDC)

Via

Access Paper or Ask Questions

Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification

Apr 14, 2022
Han Wang, Canwen Xu, Julian McAuley

Figure 1 for Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification

Figure 2 for Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification

Figure 3 for Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification

Figure 4 for Automatic Multi-Label Prompting: Simple and Interpretable Few-Shot Classification

Prompt-based learning (i.e., prompting) is an emerging paradigm for exploiting knowledge learned by a pretrained language model. In this paper, we propose Automatic Multi-Label Prompting (AMuLaP), a simple yet effective method to automatically select label mappings for few-shot text classification with prompting. Our method exploits one-to-many label mappings and a statistics-based algorithm to select label mappings given a prompt template. Our experiments demonstrate that AMuLaP achieves competitive performance on the GLUE benchmark without human effort or external resources.

* NAACL 2022 (main conference)

Via

Access Paper or Ask Questions

Investigating the Properties of Neural Network Representations in Reinforcement Learning

Mar 30, 2022
Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White

Figure 1 for Investigating the Properties of Neural Network Representations in Reinforcement Learning

Figure 2 for Investigating the Properties of Neural Network Representations in Reinforcement Learning

Figure 3 for Investigating the Properties of Neural Network Representations in Reinforcement Learning

Figure 4 for Investigating the Properties of Neural Network Representations in Reinforcement Learning

In this paper we investigate the properties of representations learned by deep reinforcement learning systems. Much of the earlier work in representation learning for reinforcement learning focused on designing fixed-basis architectures to achieve properties thought to be desirable, such as orthogonality and sparsity. In contrast, the idea behind deep reinforcement learning methods is that the agent designer should not encode representational properties, but rather that the data stream should determine the properties of the representation -- good representations emerge under appropriate training schemes. In this paper we bring these two perspectives together, empirically investigating the properties of representations that support transfer in reinforcement learning. This analysis allows us to provide novel hypotheses regarding impact of auxiliary tasks in end-to-end training of non-linear reinforcement learning methods. We introduce and measure six representational properties over more than 25 thousand agent-task settings. We consider DQN agents with convolutional networks in a pixel-based navigation environment. We develop a method to better understand \emph{why} some representations work better for transfer, through a systematic approach varying task similarity and measuring and correlating representation properties with transfer performance.

Via

Access Paper or Ask Questions