Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan Root

Learning Minimum Volume Sets and Anomaly Detectors from KNN Graphs

Jan 22, 2016

Jonathan Root, Venkatesh Saligrama, Jing Qian

Figure 1 for Learning Minimum Volume Sets and Anomaly Detectors from KNN Graphs

Figure 2 for Learning Minimum Volume Sets and Anomaly Detectors from KNN Graphs

Figure 3 for Learning Minimum Volume Sets and Anomaly Detectors from KNN Graphs

Figure 4 for Learning Minimum Volume Sets and Anomaly Detectors from KNN Graphs

Abstract:We propose a non-parametric anomaly detection algorithm for high dimensional data. We first rank scores derived from nearest neighbor graphs on $n$-point nominal training data. We then train limited complexity models to imitate these scores based on the max-margin learning-to-rank framework. A test-point is declared as an anomaly at $\alpha$-false alarm level if the predicted score is in the $\alpha$-percentile. The resulting anomaly detector is shown to be asymptotically optimal in that for any false alarm rate $\alpha$, its decision region converges to the $\alpha$-percentile minimum volume level set of the unknown underlying density. In addition, we test both the statistical performance and computational efficiency of our algorithm on a number of synthetic and real-data experiments. Our results demonstrate the superiority of our algorithm over existing $K$-NN based anomaly detection algorithms, with significant computational savings.

* arXiv admin note: substantial text overlap with arXiv:1502.01783, arXiv:1405.0530

Via

Access Paper or Ask Questions

Learning Efficient Anomaly Detectors from $K$-NN Graphs

Feb 06, 2015

Jing Qian, Jonathan Root, Venkatesh Saligrama

Figure 1 for Learning Efficient Anomaly Detectors from $K$-NN Graphs

Figure 2 for Learning Efficient Anomaly Detectors from $K$-NN Graphs

Figure 3 for Learning Efficient Anomaly Detectors from $K$-NN Graphs

Figure 4 for Learning Efficient Anomaly Detectors from $K$-NN Graphs

Abstract:We propose a non-parametric anomaly detection algorithm for high dimensional data. We score each datapoint by its average $K$-NN distance, and rank them accordingly. We then train limited complexity models to imitate these scores based on the max-margin learning-to-rank framework. A test-point is declared as an anomaly at $\alpha$-false alarm level if the predicted score is in the $\alpha$-percentile. The resulting anomaly detector is shown to be asymptotically optimal in that for any false alarm rate $\alpha$, its decision region converges to the $\alpha$-percentile minimum volume level set of the unknown underlying density. In addition, we test both the statistical performance and computational efficiency of our algorithm on a number of synthetic and real-data experiments. Our results demonstrate the superiority of our algorithm over existing $K$-NN based anomaly detection algorithms, with significant computational savings.

* arXiv admin note: text overlap with arXiv:1405.0530

Via

Access Paper or Ask Questions

A Rank-SVM Approach to Anomaly Detection

May 02, 2014

Jing Qian, Jonathan Root, Venkatesh Saligrama, Yuting Chen

Figure 1 for A Rank-SVM Approach to Anomaly Detection

Figure 2 for A Rank-SVM Approach to Anomaly Detection

Figure 3 for A Rank-SVM Approach to Anomaly Detection

Figure 4 for A Rank-SVM Approach to Anomaly Detection

Abstract:We propose a novel non-parametric adaptive anomaly detection algorithm for high dimensional data based on rank-SVM. Data points are first ranked based on scores derived from nearest neighbor graphs on n-point nominal data. We then train a rank-SVM using this ranked data. A test-point is declared as an anomaly at alpha-false alarm level if the predicted score is in the alpha-percentile. The resulting anomaly detector is shown to be asymptotically optimal and adaptive in that for any false alarm rate alpha, its decision region converges to the alpha-percentile level set of the unknown underlying density. In addition we illustrate through a number of synthetic and real-data experiments both the statistical performance and computational efficiency of our anomaly detector.

Via

Access Paper or Ask Questions