Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Manzoor A. Khan

Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing

May 07, 2025

Jacob Glenn Ayers, Buvaneswari A. Ramanan, Manzoor A. Khan

Figure 1 for Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing

Figure 2 for Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing

Figure 3 for Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing

Figure 4 for Detecting Concept Drift in Neural Networks Using Chi-squared Goodness of Fit Testing

Abstract:As the adoption of deep learning models has grown beyond human capacity for verification, meta-algorithms are needed to ensure reliable model inference. Concept drift detection is a field dedicated to identifying statistical shifts that is underutilized in monitoring neural networks that may encounter inference data with distributional characteristics diverging from their training data. Given the wide variety of model architectures, applications, and datasets, it is important that concept drift detection algorithms are adaptable to different inference scenarios. In this paper, we introduce an application of the $\chi^2$ Goodness of Fit Hypothesis Test as a drift detection meta-algorithm applied to a multilayer perceptron, a convolutional neural network, and a transformer trained for machine vision as they are exposed to simulated drift during inference. To that end, we demonstrate how unexpected drops in accuracy due to concept drift can be detected without directly examining the inference outputs. Our approach enhances safety by ensuring models are continually evaluated for reliability across varying conditions.

* 8 pages, 6 figures, 1 table

Via

Access Paper or Ask Questions

Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

May 10, 2024

Harsh Patel, Buvaneswari A. Ramanan, Manzoor A. Khan, Thomas Williams, Brian Friedman, Lawrence Drabeck

Figure 1 for Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

Figure 2 for Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

Figure 3 for Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

Figure 4 for Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs

Abstract:This paper explores the possibilities of the current generation of Large Language Models for incorporating Machine Learning Operations (MLOps) functionalities into ML training code bases. We evaluate the performance of OpenAI (gpt-3.5-turbo) and WizardCoder (open-source, 15B parameters) models on the automated accomplishment of various MLOps functionalities in different settings. We perform a benchmarking study that assesses the ability of these models to: (1) adapt existing code samples (Inlining) with component-specific MLOps functionality such as MLflow and Weights & Biases for experiment tracking, Optuna for hyperparameter optimization etc., and (2) perform the task of Translation from one component of an MLOps functionality to another, e.g., translating existing GitPython library based version control code to Data Version Control library based. We also propose three different approaches that involve teaching LLMs to comprehend the API documentation of the components as a reference while accomplishing the Translation tasks. In our evaluations, the gpt-3.5-turbo model significantly outperforms WizardCoder by achieving impressive Pass@3 accuracy in model optimization (55% compared to 0% by WizardCoder), experiment tracking (100%, compared to 62.5% by WizardCoder), model registration (92% compared to 42% by WizardCoder) and hyperparameter optimization (83% compared to 58% by WizardCoder) on average, in their best possible settings, showcasing its superior code adaptability performance in complex MLOps tasks.

* The work was completed during 2Q, 3Q of Year 2023, when WizardCoder was the top performing Open source LLM for coding. Newer and better models have emerged since then. The processes and methodologies utilized for this benchmarking can still be utilized for evaluating the current SoTA models

Via

Access Paper or Ask Questions