Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

William Hsu

Feature Selection for Learning to Predict Outcomes of Compute Cluster Jobs with Application to Decision Support

Dec 14, 2020

Adedolapo Okanlawon, Huichen Yang, Avishek Bose, William Hsu, Dan Andresen, Mohammed Tanash

Figure 1 for Feature Selection for Learning to Predict Outcomes of Compute Cluster Jobs with Application to Decision Support

Figure 2 for Feature Selection for Learning to Predict Outcomes of Compute Cluster Jobs with Application to Decision Support

Figure 3 for Feature Selection for Learning to Predict Outcomes of Compute Cluster Jobs with Application to Decision Support

Figure 4 for Feature Selection for Learning to Predict Outcomes of Compute Cluster Jobs with Application to Decision Support

Abstract:We present a machine learning framework and a new test bed for data mining from the Slurm Workload Manager for high-performance computing (HPC) clusters. The focus was to find a method for selecting features to support decisions: helping users decide whether to resubmit failed jobs with boosted CPU and memory allocations or migrate them to a computing cloud. This task was cast as both supervised classification and regression learning, specifically, sequential problem solving suitable for reinforcement learning. Selecting relevant features can improve training accuracy, reduce training time, and produce a more comprehensible model, with an intelligent system that can explain predictions and inferences. We present a supervised learning model trained on a Simple Linux Utility for Resource Management (Slurm) data set of HPC jobs using three different techniques for selecting features: linear regression, lasso, and ridge regression. Our data set represented both HPC jobs that failed and those that succeeded, so our model was reliable, less likely to overfit, and generalizable. Our model achieved an R^2 of 95\% with 99\% accuracy. We identified five predictors for both CPU and memory properties.

* 6 pages, Proceedings of the International Conference on Computational Science and Computational Intelligence Symposium on Parallel & Distributed Computing (CSCI-ISPD 2020)

Via

Access Paper or Ask Questions

Shape-aware Generative Adversarial Networks for Attribute Transfer

Oct 11, 2020

Lei Luo, William Hsu, Shangxian Wang

Figure 1 for Shape-aware Generative Adversarial Networks for Attribute Transfer

Figure 2 for Shape-aware Generative Adversarial Networks for Attribute Transfer

Figure 3 for Shape-aware Generative Adversarial Networks for Attribute Transfer

Figure 4 for Shape-aware Generative Adversarial Networks for Attribute Transfer

Abstract:Generative adversarial networks (GANs) have been successfully applied to transfer visual attributes in many domains, including that of human face images. This success is partly attributable to the facts that human faces have similar shapes and the positions of eyes, noses, and mouths are fixed among different people. Attribute transfer is more challenging when the source and target domain share different shapes. In this paper, we introduce a shape-aware GAN model that is able to preserve shape when transferring attributes, and propose its application to some real-world domains. Compared to other state-of-art GANs-based image-to-image translation models, the model we propose is able to generate more visually appealing results while maintaining the quality of results from transfer learning.

Via

Access Paper or Ask Questions

Using a Generative Adversarial Network for CT Normalization and its Impact on Radiomic Features

Jan 22, 2020

Leihao Wei, Yannan Lin, William Hsu

Figure 1 for Using a Generative Adversarial Network for CT Normalization and its Impact on Radiomic Features

Figure 2 for Using a Generative Adversarial Network for CT Normalization and its Impact on Radiomic Features

Figure 3 for Using a Generative Adversarial Network for CT Normalization and its Impact on Radiomic Features

Figure 4 for Using a Generative Adversarial Network for CT Normalization and its Impact on Radiomic Features

Abstract:Computer-Aided-Diagnosis (CADx) systems assist radiologists with identifying and classifying potentially malignant pulmonary nodules on chest CT scans using morphology and texture-based (radiomic) features. However, radiomic features are sensitive to differences in acquisitions due to variations in dose levels and slice thickness. This study investigates the feasibility of generating a normalized scan from heterogeneous CT scans as input. We obtained projection data from 40 low-dose chest CT scans, simulating acquisitions at 10%, 25% and 50% dose and reconstructing the scans at 1.0mm and 2.0mm slice thickness. A 3D generative adversarial network (GAN) was used to simultaneously normalize reduced dose, thick slice (2.0mm) images to normal dose (100%), thinner slice (1.0mm) images. We evaluated the normalized image quality using peak signal-to-noise ratio (PSNR), structural similarity index (SSIM) and Learned Perceptual Image Patch Similarity (LPIPS). Our GAN improved perceptual similarity by 35%, compared to a baseline CNN method. Our analysis also shows that the GAN-based approach led to a significantly smaller error (p-value < 0.05) in nine studied radiomic features. These results indicated that GANs could be used to normalize heterogeneous CT images and reduce the variability in radiomic feature values.

* ISBI 2020

Via

Access Paper or Ask Questions

Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies

Jun 03, 2019

Vahid Behzadan, William Hsu

Figure 1 for Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies

Figure 2 for Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies

Figure 3 for Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies

Figure 4 for Sequential Triggers for Watermarking of Deep Reinforcement Learning Policies

Abstract:This paper proposes a novel scheme for the watermarking of Deep Reinforcement Learning (DRL) policies. This scheme provides a mechanism for the integration of a unique identifier within the policy in the form of its response to a designated sequence of state transitions, while incurring minimal impact on the nominal performance of the policy. The applications of this watermarking scheme include detection of unauthorized replications of proprietary policies, as well as enabling the graceful interruption or termination of DRL activities by authorized entities. We demonstrate the feasibility of our proposal via experimental evaluation of watermarking a DQN policy trained in the Cartpole environment.

Via

Access Paper or Ask Questions

Adversarial Exploitation of Policy Imitation

Jun 03, 2019

Vahid Behzadan, William Hsu

Figure 1 for Adversarial Exploitation of Policy Imitation

Figure 2 for Adversarial Exploitation of Policy Imitation

Figure 3 for Adversarial Exploitation of Policy Imitation

Figure 4 for Adversarial Exploitation of Policy Imitation

Abstract:This paper investigates a class of attacks targeting the confidentiality aspect of security in Deep Reinforcement Learning (DRL) policies. Recent research have established the vulnerability of supervised machine learning models (e.g., classifiers) to model extraction attacks. Such attacks leverage the loosely-restricted ability of the attacker to iteratively query the model for labels, thereby allowing for the forging of a labeled dataset which can be used to train a replica of the original model. In this work, we demonstrate the feasibility of exploiting imitation learning techniques in launching model extraction attacks on DRL agents. Furthermore, we develop proof-of-concept attacks that leverage such techniques for black-box attacks against the integrity of DRL policies. We also present a discussion on potential solution concepts for mitigation techniques.

Via

Access Paper or Ask Questions

Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)

Jun 03, 2019

Vahid Behzadan, William Hsu

Figure 1 for Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)

Figure 2 for Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)

Figure 3 for Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)

Figure 4 for Analysis and Improvement of Adversarial Training in DQN Agents With Adversarially-Guided Exploration (AGE)

Abstract:This paper investigates the effectiveness of adversarial training in enhancing the robustness of Deep Q-Network (DQN) policies to state-space perturbations. We first present a formal analysis of adversarial training in DQN agents and its performance with respect to the proportion of adversarial perturbations to nominal observations used for training. Next, we consider the sample-inefficiency of current adversarial training techniques, and propose a novel Adversarially-Guided Exploration (AGE) mechanism based on a modified hybrid of the $\epsilon$-greedy algorithm and Boltzmann exploration. We verify the feasibility of this exploration mechanism through experimental evaluation of its performance in comparison with the traditional decaying $\epsilon$-greedy and parameter-space noise exploration algorithms.

Via

Access Paper or Ask Questions

RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies

Jun 03, 2019

Vahid Behzadan, William Hsu

Figure 1 for RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies

Figure 2 for RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies

Figure 3 for RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies

Figure 4 for RL-Based Method for Benchmarking the Adversarial Resilience and Robustness of Deep Reinforcement Learning Policies

Abstract:This paper investigates the resilience and robustness of Deep Reinforcement Learning (DRL) policies to adversarial perturbations in the state space. We first present an approach for the disentanglement of vulnerabilities caused by representation learning of DRL agents from those that stem from the sensitivity of the DRL policies to distributional shifts in state transitions. Building on this approach, we propose two RL-based techniques for quantitative benchmarking of adversarial resilience and robustness in DRL policies against perturbations of state transitions. We demonstrate the feasibility of our proposals through experimental evaluation of resilience and robustness in DQN, A2C, and PPO2 policies trained in the Cartpole environment.

Via

Access Paper or Ask Questions

OpBerg: Discovering causal sentences using optimal alignments

Apr 03, 2019

Justin Wood, Nicholas J. Matiasz, Alcino J. Silva, William Hsu, Alexej Abyzov, Wei Wang

Abstract:The biological literature is rich with sentences that describe causal relations. Methods that automatically extract such sentences can help biologists to synthesize the literature and even discover latent relations that had not been articulated explicitly. Current methods for extracting causal sentences are based on either machine learning or a predefined database of causal terms. Machine learning approaches require a large set of labeled training data and can be susceptible to noise. Methods based on predefined databases are limited by the quality of their curation and are unable to capture new concepts or mistakes in the input. We address these challenges by adapting and improving a method designed for a seemingly unrelated problem: finding alignments between genomic sequences. This paper presents a novel and outperforming method for extracting causal relations from text by aligning the part-of-speech representations of an input set with that of known causal sentences. Our experiments show that when applied to the task of finding causal sentences in biological literature, our method improves on the accuracy of other methods in a computationally efficient manner.

Via

Access Paper or Ask Questions

Computed Tomography Image Enhancement using 3D Convolutional Neural Network

Jul 18, 2018

Meng Li, Shiwen Shen, Wen Gao, William Hsu, Jason Cong

Figure 1 for Computed Tomography Image Enhancement using 3D Convolutional Neural Network

Figure 2 for Computed Tomography Image Enhancement using 3D Convolutional Neural Network

Figure 3 for Computed Tomography Image Enhancement using 3D Convolutional Neural Network

Figure 4 for Computed Tomography Image Enhancement using 3D Convolutional Neural Network

Abstract:Computed tomography (CT) is increasingly being used for cancer screening, such as early detection of lung cancer. However, CT studies have varying pixel spacing due to differences in acquisition parameters. Thick slice CTs have lower resolution, hindering tasks such as nodule characterization during computer-aided detection due to partial volume effect. In this study, we propose a novel 3D enhancement convolutional neural network (3DECNN) to improve the spatial resolution of CT studies that were acquired using lower resolution/slice thicknesses to higher resolutions. Using a subset of the LIDC dataset consisting of 20,672 CT slices from 100 scans, we simulated lower resolution/thick section scans then attempted to reconstruct the original images using our 3DECNN network. A significant improvement in PSNR (29.3087dB vs. 28.8769dB, p-value < 2.2e-16) and SSIM (0.8529dB vs. 0.8449dB, p-value < 2.2e-16) compared to other state-of-art deep learning methods is observed.

Via

Access Paper or Ask Questions

Machine Learning for Predictive Analytics of Compute Cluster Jobs

May 20, 2018

Dan Andresen, William Hsu, Huichen Yang, Adedolapo Okanlawon

Figure 1 for Machine Learning for Predictive Analytics of Compute Cluster Jobs

Figure 2 for Machine Learning for Predictive Analytics of Compute Cluster Jobs

Figure 3 for Machine Learning for Predictive Analytics of Compute Cluster Jobs

Figure 4 for Machine Learning for Predictive Analytics of Compute Cluster Jobs

Abstract:We address the problem of predicting whether sufficient memory and CPU resources have been requested for jobs at submission time. For this purpose, we examine the task of training a supervised machine learning system to predict the outcome - whether the job will fail specifically due to insufficient resources - as a classification task. Sufficiently high accuracy, precision, and recall at this task facilitates more anticipatory decision support applications in the domain of HPC resource allocation. Our preliminary results using a new test bed show that the probability of failed jobs is associated with information freely available at job submission time and may thus be usable by a learning system for user modeling that gives personalized feedback to users.

* 7 pages, CSC'18 - The 16th Int'l Conf on Scientific Computing

Via

Access Paper or Ask Questions