Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Maria L. Weese

The use of cross validation in the analysis of designed experiments

Jun 17, 2025

Maria L. Weese, Byran J. Smucker, David J. Edwards

Abstract:Cross-validation (CV) is a common method to tune machine learning methods and can be used for model selection in regression as well. Because of the structured nature of small, traditional experimental designs, the literature has warned against using CV in their analysis. The striking increase in the use of machine learning, and thus CV, in the analysis of experimental designs, has led us to empirically study the effectiveness of CV compared to other methods of selecting models in designed experiments, including the little bootstrap. We consider both response surface settings where prediction is of primary interest, as well as screening where factor selection is most important. Overall, we provide evidence that the use of leave-one-out cross-validation (LOOCV) in the analysis of small, structured is often useful. More general $k$-fold CV may also be competitive but its performance is uneven.

Via

Access Paper or Ask Questions

Boundary Peeling: Outlier Detection Method Using One-Class Peeling

Sep 11, 2023

Sheikh Arafat, Na Sun, Maria L. Weese, Waldyn G. Martinez

Figure 1 for Boundary Peeling: Outlier Detection Method Using One-Class Peeling

Figure 2 for Boundary Peeling: Outlier Detection Method Using One-Class Peeling

Figure 3 for Boundary Peeling: Outlier Detection Method Using One-Class Peeling

Figure 4 for Boundary Peeling: Outlier Detection Method Using One-Class Peeling

Abstract:Unsupervised outlier detection constitutes a crucial phase within data analysis and remains a dynamic realm of research. A good outlier detection algorithm should be computationally efficient, robust to tuning parameter selection, and perform consistently well across diverse underlying data distributions. We introduce One-Class Boundary Peeling, an unsupervised outlier detection algorithm. One-class Boundary Peeling uses the average signed distance from iteratively-peeled, flexible boundaries generated by one-class support vector machines. One-class Boundary Peeling has robust hyperparameter settings and, for increased flexibility, can be cast as an ensemble method. In synthetic data simulations One-Class Boundary Peeling outperforms all state of the art methods when no outliers are present while maintaining comparable or superior performance in the presence of outliers, as compared to benchmark methods. One-Class Boundary Peeling performs competitively in terms of correct classification, AUC, and processing time using common benchmark data sets.

Via

Access Paper or Ask Questions