Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xudong Sun

Resampling-based Assessment of Robustness to Distribution Shift for Deep Neural Networks

Jun 07, 2019

Xudong Sun, Yu Wang, Alexej Gossmann, Bernd Bischl

Figure 1 for Resampling-based Assessment of Robustness to Distribution Shift for Deep Neural Networks

Figure 2 for Resampling-based Assessment of Robustness to Distribution Shift for Deep Neural Networks

Figure 3 for Resampling-based Assessment of Robustness to Distribution Shift for Deep Neural Networks

Figure 4 for Resampling-based Assessment of Robustness to Distribution Shift for Deep Neural Networks

Abstract:A novel resampling framework is proposed to evaluate the robustness and generalization capability of deep learning models with respect to distribution shift. We use Auto Encoder Variational Bayes to find a latent representation of the data, on which a Variational Gaussian Mixture Model is applied to deliberately create distribution shift by dividing the dataset into different clusters. Wasserstein distance is used to characterize the extent of distribution shift between the training and the testing data splits. We compare several conventional Convolutional Neural Network (CNN) architectures as well as Bayesian CNN models for image classification on the Fashion-MNIST dataset to assess their robustness under the deliberately created distribution shift.

Via

Access Paper or Ask Questions

Maximum Entropy-Regularized Multi-Goal Reinforcement Learning

May 28, 2019

Rui Zhao, Xudong Sun, Volker Tresp

Figure 1 for Maximum Entropy-Regularized Multi-Goal Reinforcement Learning

Figure 2 for Maximum Entropy-Regularized Multi-Goal Reinforcement Learning

Figure 3 for Maximum Entropy-Regularized Multi-Goal Reinforcement Learning

Figure 4 for Maximum Entropy-Regularized Multi-Goal Reinforcement Learning

Abstract:In Multi-Goal Reinforcement Learning, an agent learns to achieve multiple goals with a goal-conditioned policy. During learning, the agent first collects the trajectories into a replay buffer, and later these trajectories are selected randomly for replay. However, the achieved goals in the replay buffer are often biased towards the behavior policies. From a Bayesian perspective, when there is no prior knowledge about the target goal distribution, the agent should learn uniformly from diverse achieved goals. Therefore, we first propose a novel multi-goal RL objective based on weighted entropy. This objective encourages the agent to maximize the expected return, as well as to achieve more diverse goals. Secondly, we developed a maximum entropy-based prioritization framework to optimize the proposed objective. For evaluation of this framework, we combine it with Deep Deterministic Policy Gradient, both with or without Hindsight Experience Replay. On a set of multi-goal robotic tasks of OpenAI Gym, we compare our method with other baselines and show promising improvements in both performance and sample-efficiency.

* PMLR 97:7553-7562, 2019
* Published in International Conference on Machine Learning (ICML 2019), Long Beach, USA. arXiv admin note: text overlap with arXiv:1902.08039

Via

Access Paper or Ask Questions

ReinBo: Machine Learning pipeline search and configuration with Bayesian Optimization embedded Reinforcement Learning

Apr 10, 2019

Xudong Sun, Jiali Lin, Bernd Bischl

Figure 1 for ReinBo: Machine Learning pipeline search and configuration with Bayesian Optimization embedded Reinforcement Learning

Figure 2 for ReinBo: Machine Learning pipeline search and configuration with Bayesian Optimization embedded Reinforcement Learning

Figure 3 for ReinBo: Machine Learning pipeline search and configuration with Bayesian Optimization embedded Reinforcement Learning

Figure 4 for ReinBo: Machine Learning pipeline search and configuration with Bayesian Optimization embedded Reinforcement Learning

Abstract:Machine learning pipeline potentially consists of several stages of operations like data preprocessing, feature engineering and machine learning model training. Each operation has a set of hyper-parameters, which can become irrelevant for the pipeline when the operation is not selected. This gives rise to a hierarchical conditional hyper-parameter space. To optimize this mixed continuous and discrete conditional hierarchical hyper-parameter space, we propose an efficient pipeline search and configuration algorithm which combines the power of Reinforcement Learning and Bayesian Optimization. Empirical results show that our method performs favorably compared to state of the art methods like Auto-sklearn , TPOT, Tree Parzen Window, and Random Search.

Via

Access Paper or Ask Questions

High Dimensional Restrictive Federated Model Selection with multi-objective Bayesian Optimization over shifted distributions

Feb 24, 2019

Xudong Sun, Andrea Bommert, Florian Pfisterer, Jörg Rahnenführer, Michel Lang, Bernd Bischl

Figure 1 for High Dimensional Restrictive Federated Model Selection with multi-objective Bayesian Optimization over shifted distributions

Figure 2 for High Dimensional Restrictive Federated Model Selection with multi-objective Bayesian Optimization over shifted distributions

Figure 3 for High Dimensional Restrictive Federated Model Selection with multi-objective Bayesian Optimization over shifted distributions

Figure 4 for High Dimensional Restrictive Federated Model Selection with multi-objective Bayesian Optimization over shifted distributions

Abstract:A novel machine learning optimization process coined Restrictive Federated Model Selection (RFMS) is proposed under the scenario, for example, when data from healthcare units can not leave the site it is situated on and it is forbidden to carry out training algorithms on remote data sites due to either technical or privacy and trust concerns. To carry out a clinical research under this scenario, an analyst could train a machine learning model only on local data site, but it is still possible to execute a statistical query at a certain cost in the form of sending a machine learning model to some of the remote data sites and get the performance measures as feedback, maybe due to prediction being usually much cheaper. Compared to federated learning, which is optimizing the model parameters directly by carrying out training across all data sites, RFMS trains model parameters only on one local data site but optimizes hyper-parameters across other data sites jointly since hyper-parameters play an important role in machine learning performance. The aim is to get a Pareto optimal model with respective to both local and remote unseen prediction losses, which could generalize well across data sites. In this work, we specifically consider high dimensional data with shifted distributions over data sites. As an initial investigation, Bayesian Optimization especially multi-objective Bayesian Optimization is used to guide an adaptive hyper-parameter optimization process to select models under the RFMS scenario. Empirical results show that solely using the local data site to tune hyper-parameters generalizes poorly across data sites, compared to methods that utilize the local and remote performances. Furthermore, in terms of dominated hypervolumes, multi-objective Bayesian Optimization algorithms show increased performance across multiple data sites among other candidates.

Via

Access Paper or Ask Questions

Face Detection using Deep Learning: An Improved Faster RCNN Approach

Jan 28, 2017

Xudong Sun, Pengcheng Wu, Steven C. H. Hoi

Figure 1 for Face Detection using Deep Learning: An Improved Faster RCNN Approach

Figure 2 for Face Detection using Deep Learning: An Improved Faster RCNN Approach

Figure 3 for Face Detection using Deep Learning: An Improved Faster RCNN Approach

Figure 4 for Face Detection using Deep Learning: An Improved Faster RCNN Approach

Abstract:In this report, we present a new face detection scheme using deep learning and achieve the state-of-the-art detection performance on the well-known FDDB face detetion benchmark evaluation. In particular, we improve the state-of-the-art faster RCNN framework by combining a number of strategies, including feature concatenation, hard negative mining, multi-scale training, model pretraining, and proper calibration of key parameters. As a consequence, the proposed scheme obtained the state-of-the-art face detection performance, making it the best model in terms of ROC curves among all the published methods on the FDDB benchmark.

Via

Access Paper or Ask Questions