Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sicco Verwer

SAGE: Intrusion Alert-driven Attack Graph Extractor

Jul 06, 2021
Azqa Nadeem, Sicco Verwer, Stephen Moskal, Shanchieh Jay Yang

Figure 1 for SAGE: Intrusion Alert-driven Attack Graph Extractor

Figure 2 for SAGE: Intrusion Alert-driven Attack Graph Extractor

Figure 3 for SAGE: Intrusion Alert-driven Attack Graph Extractor

Figure 4 for SAGE: Intrusion Alert-driven Attack Graph Extractor

Attack graphs (AG) are used to assess pathways availed by cyber adversaries to penetrate a network. State-of-the-art approaches for AG generation focus mostly on deriving dependencies between system vulnerabilities based on network scans and expert knowledge. In real-world operations however, it is costly and ineffective to rely on constant vulnerability scanning and expert-crafted AGs. We propose to automatically learn AGs based on actions observed through intrusion alerts, without prior expert knowledge. Specifically, we develop an unsupervised sequence learning system, SAGE, that leverages the temporal and probabilistic dependence between alerts in a suffix-based probabilistic deterministic finite automaton (S-PDFA) -- a model that accentuates infrequent severe alerts and summarizes paths leading to them. AGs are then derived from the S-PDFA. Tested with intrusion alerts collected through Collegiate Penetration Testing Competition, SAGE produces AGs that reflect the strategies used by participating teams. The resulting AGs are succinct, interpretable, and enable analysts to derive actionable insights, e.g., attackers tend to follow shorter paths after they have discovered a longer one.

* Accepted to appear in the 1st KDD Workshop on AI-enabled Cybersecurity Analytics (AI4cyber), 2021

Via

Access Paper or Ask Questions

EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box Functions

Jun 08, 2021
Laurens Bliek, Arthur Guijt, Rickard Karlsson, Sicco Verwer, Mathijs de Weerdt

Figure 1 for EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box Functions

Figure 2 for EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box Functions

Figure 3 for EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box Functions

Figure 4 for EXPObench: Benchmarking Surrogate-based Optimisation Algorithms on Expensive Black-box Functions

Surrogate algorithms such as Bayesian optimisation are especially designed for black-box optimisation problems with expensive objectives, such as hyperparameter tuning or simulation-based optimisation. In the literature, these algorithms are usually evaluated with synthetic benchmarks which are well established but have no expensive objective, and only on one or two real-life applications which vary wildly between papers. There is a clear lack of standardisation when it comes to benchmarking surrogate algorithms on real-life, expensive, black-box objective functions. This makes it very difficult to draw conclusions on the effect of algorithmic contributions. A new benchmark library, EXPObench, provides first steps towards such a standardisation. The library is used to provide an extensive comparison of six different surrogate algorithms on four expensive optimisation problems from different real-life applications. This has led to new insights regarding the relative importance of exploration, the evaluation time of the objective, and the used model. A further contribution is that we make the algorithms and benchmark problem instances publicly available, contributing to more uniform analysis of surrogate algorithms. Most importantly, we include the performance of the six algorithms on all evaluated problem instances. This results in a unique new dataset that lowers the bar for researching new methods as the number of expensive evaluations required for comparison is significantly reduced.

* 13 pages

Via

Access Paper or Ask Questions

Efficient Training of Robust Decision Trees Against Adversarial Examples

Dec 18, 2020
Daniël Vos, Sicco Verwer

Figure 1 for Efficient Training of Robust Decision Trees Against Adversarial Examples

Figure 2 for Efficient Training of Robust Decision Trees Against Adversarial Examples

Figure 3 for Efficient Training of Robust Decision Trees Against Adversarial Examples

Figure 4 for Efficient Training of Robust Decision Trees Against Adversarial Examples

In the present day we use machine learning for sensitive tasks that require models to be both understandable and robust. Although traditional models such as decision trees are understandable, they suffer from adversarial attacks. When a decision tree is used to differentiate between a user's benign and malicious behavior, an adversarial attack allows the user to effectively evade the model by perturbing the inputs the model receives. We can use algorithms that take adversarial attacks into account to fit trees that are more robust. In this work we propose an algorithm, GROOT, that is two orders of magnitude faster than the state-of-the-art-work while scoring competitively on accuracy against adversaries. GROOT accepts an intuitive and permissible threat model. Where previous threat models were limited to distance norms, we allow each feature to be perturbed with a user-specified parameter: either a maximum distance or constraints on the direction of perturbation. Previous works assumed that both benign and malicious users attempt model evasion but we allow the user to select which classes perform adversarial attacks. Additionally, we introduce a hyperparameter rho that allows GROOT to trade off performance in the regular and adversarial settings.

Via

Access Paper or Ask Questions

Continuous surrogate-based optimization algorithms are well-suited for expensive discrete problems

Nov 06, 2020
Rickard Karlsson, Laurens Bliek, Sicco Verwer, Mathijs de Weerdt

Figure 1 for Continuous surrogate-based optimization algorithms are well-suited for expensive discrete problems

Figure 2 for Continuous surrogate-based optimization algorithms are well-suited for expensive discrete problems

Figure 3 for Continuous surrogate-based optimization algorithms are well-suited for expensive discrete problems

One method to solve expensive black-box optimization problems is to use a surrogate model that approximates the objective based on previous observed evaluations. The surrogate, which is cheaper to evaluate, is optimized instead to find an approximate solution to the original problem. In the case of discrete problems, recent research has revolved around surrogate models that are specifically constructed to deal with discrete structures. A main motivation is that literature considers continuous methods, such as Bayesian optimization with Gaussian processes as the surrogate, to be sub-optimal (especially in higher dimensions) because they ignore the discrete structure by e.g. rounding off real-valued solutions to integers. However, we claim that this is not true. In fact, we present empirical evidence showing that the use of continuous surrogate models displays competitive performance on a set of high-dimensional discrete benchmark problems, including a real-life application, against state-of-the-art discrete surrogate-based methods. Our experiments on different discrete structures and time constraints also give more insight into which algorithms work well on which type of problem.

* 15 pages, 3 figures

Via

Access Paper or Ask Questions

Black-box Mixed-Variable Optimisation using a Surrogate Model that Satisfies Integer Constraints

Jun 08, 2020
Laurens Bliek, Sicco Verwer, Mathijs de Weerdt

Figure 1 for Black-box Mixed-Variable Optimisation using a Surrogate Model that Satisfies Integer Constraints

Figure 2 for Black-box Mixed-Variable Optimisation using a Surrogate Model that Satisfies Integer Constraints

Figure 3 for Black-box Mixed-Variable Optimisation using a Surrogate Model that Satisfies Integer Constraints

A challenging problem in both engineering and computer science is that of minimising a function for which we have no mathematical formulation available, that is expensive to evaluate, and that contains continuous and integer variables, for example in automatic algorithm configuration. Surrogate modelling techniques are very suitable for this type of problem, but most existing techniques are designed with only continuous or only discrete variables in mind. Mixed-Variable ReLU-based Surrogate Modelling (MVRSM) is a surrogate modelling algorithm that uses a linear combination of rectified linear units, defined in such a way that (local) optima satisfy the integer constraints. This method is both more accurate and more efficient than the state of the art on several benchmarks with up to 238 continuous and integer variables.

Via

Access Paper or Ask Questions

Black-box Combinatorial Optimization using Models with Integer-valued Minima

Nov 20, 2019
Laurens Bliek, Sicco Verwer, Mathijs de Weerdt

Figure 1 for Black-box Combinatorial Optimization using Models with Integer-valued Minima

Figure 2 for Black-box Combinatorial Optimization using Models with Integer-valued Minima

Figure 3 for Black-box Combinatorial Optimization using Models with Integer-valued Minima

Figure 4 for Black-box Combinatorial Optimization using Models with Integer-valued Minima

When a black-box optimization objective can only be evaluated with costly or noisy measurements, most standard optimization algorithms are unsuited to find the optimal solution. Specialized algorithms that deal with exactly this situation make use of surrogate models. These models are usually continuous and smooth, which is beneficial for continuous optimization problems, but not necessarily for combinatorial problems. However, by choosing the basis functions of the surrogate model in a certain way, we show that it can be guaranteed that the optimal solution of the surrogate model is integer. This approach outperforms random search, simulated annealing and one Bayesian optimization algorithm on the problem of finding robust routes for a noise-perturbed traveling salesman benchmark problem, with similar performance as another Bayesian optimization algorithm, and outperforms all compared algorithms on a convex binary optimization problem with a large number of variables.

Via

Access Paper or Ask Questions

Learning a Safety Verifiable Adaptive Cruise Controller from Human Driving Data

Oct 29, 2019
Qin Lin, Sicco Verwer, John Dolan

Figure 1 for Learning a Safety Verifiable Adaptive Cruise Controller from Human Driving Data

Figure 2 for Learning a Safety Verifiable Adaptive Cruise Controller from Human Driving Data

Figure 3 for Learning a Safety Verifiable Adaptive Cruise Controller from Human Driving Data

Figure 4 for Learning a Safety Verifiable Adaptive Cruise Controller from Human Driving Data

Imitation learning provides a way to automatically construct a controller by mimicking human behavior from data. For safety-critical systems such as autonomous vehicles, it can be problematic to use controllers learned from data because they cannot be guaranteed to be collision-free. Recently, a method has been proposed for learning a multi-mode hybrid automaton cruise controller (MOHA). Besides being accurate, the logical nature of this model makes it suitable for formal verification. In this paper, we demonstrate this capability using the SpaceEx hybrid model checker as follows. After learning, we translate the automaton model into constraints and equations required by SpaceEx. We then verify that a pure MOHA controller is not collision-free. By adding a safety state based on headway in time, a rule that human drivers should follow anyway, we do obtain a provably safe cruise control. Moreover, the safe controller remains more human-like than existing cruise controllers.

Via

Access Paper or Ask Questions

Human in the Loop: Interactive Passive Automata Learning via Evidence-Driven State-Merging Algorithms

Jul 28, 2017
Christian A. Hammerschmidt, Radu State, Sicco Verwer

Figure 1 for Human in the Loop: Interactive Passive Automata Learning via Evidence-Driven State-Merging Algorithms

Figure 2 for Human in the Loop: Interactive Passive Automata Learning via Evidence-Driven State-Merging Algorithms

We present an interactive version of an evidence-driven state-merging (EDSM) algorithm for learning variants of finite state automata. Learning these automata often amounts to recovering or reverse engineering the model generating the data despite noisy, incomplete, or imperfectly sampled data sources rather than optimizing a purely numeric target function. Domain expertise and human knowledge about the target domain can guide this process, and typically is captured in parameter settings. Often, domain expertise is subconscious and not expressed explicitly. Directly interacting with the learning algorithm makes it easier to utilize this knowledge effectively.

* 4 pages, presented at the Human in the Loop workshop at ICML 2017

Via

Access Paper or Ask Questions

Learning Pairwise Disjoint Simple Languages from Positive Examples

Jun 06, 2017
Alexis Linard, Rick Smetsers, Frits Vaandrager, Umar Waqas, Joost van Pinxten, Sicco Verwer

Figure 1 for Learning Pairwise Disjoint Simple Languages from Positive Examples

Figure 2 for Learning Pairwise Disjoint Simple Languages from Positive Examples

Figure 3 for Learning Pairwise Disjoint Simple Languages from Positive Examples

A classical problem in grammatical inference is to identify a deterministic finite automaton (DFA) from a set of positive and negative examples. In this paper, we address the related - yet seemingly novel - problem of identifying a set of DFAs from examples that belong to different unknown simple regular languages. We propose two methods based on compression for clustering the observed positive examples. We apply our methods to a set of print jobs submitted to large industrial printers.

* This paper has been accepted at the Learning and Automata (LearnAut) Workshop, LICS 2017 (Reykjavik, Iceland)

Via

Access Paper or Ask Questions

Anomaly Detection in a Digital Video Broadcasting System Using Timed Automata

May 24, 2017
Xiaoran Liu, Qin Lin, Sicco Verwer, Dmitri Jarnikov

Figure 1 for Anomaly Detection in a Digital Video Broadcasting System Using Timed Automata

Figure 2 for Anomaly Detection in a Digital Video Broadcasting System Using Timed Automata

Figure 3 for Anomaly Detection in a Digital Video Broadcasting System Using Timed Automata

Figure 4 for Anomaly Detection in a Digital Video Broadcasting System Using Timed Automata

This paper focuses on detecting anomalies in a digital video broadcasting (DVB) system from providers' perspective. We learn a probabilistic deterministic real timed automaton profiling benign behavior of encryption control in the DVB control access system. This profile is used as a one-class classifier. Anomalous items in a testing sequence are detected when the sequence is not accepted by the learned model.

* This paper has been accepted by the Thirty-Second Annual ACM/IEEE Symposium on Logic in Computer Science (LICS) Workshop on Learning and Automata (LearnAut)

Via

Access Paper or Ask Questions