Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Amol Khanna

Adversarial Samples Are Not Created Equal

Jan 02, 2026

Jennifer Crawford, Amol Khanna, Fred Lu, Amy R. Wagoner, Stella Biderman, Andre T. Nguyen, Edward Raff

Abstract:Over the past decade, numerous theories have been proposed to explain the widespread vulnerability of deep neural networks to adversarial evasion attacks. Among these, the theory of non-robust features proposed by Ilyas et al. has been widely accepted, showing that brittle but predictive features of the data distribution can be directly exploited by attackers. However, this theory overlooks adversarial samples that do not directly utilize these features. In this work, we advocate that these two kinds of samples - those which use use brittle but predictive features and those that do not - comprise two types of adversarial weaknesses and should be differentiated when evaluating adversarial robustness. For this purpose, we propose an ensemble-based metric to measure the manipulation of non-robust features by adversarial perturbations and use this metric to analyze the makeup of adversarial samples generated by attackers. This new perspective also allows us to re-examine multiple phenomena, including the impact of sharpness-aware minimization on adversarial robustness and the robustness gap observed between adversarially training and standard training on robust datasets.

Via

Access Paper or Ask Questions

Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent

Mar 25, 2025

Philip Doldo, Derek Everett, Amol Khanna, Andre T Nguyen, Edward Raff

Figure 1 for Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent

Figure 2 for Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent

Figure 3 for Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent

Figure 4 for Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent

Abstract:Projected Gradient Descent (PGD) under the $L_\infty$ ball has become one of the defacto methods used in adversarial robustness evaluation for computer vision (CV) due to its reliability and efficacy, making a strong and easy-to-implement iterative baseline. However, PGD is computationally demanding to apply, especially when using thousands of iterations is the current best-practice recommendation to generate an adversarial example for a single image. In this work, we introduce a simple novel method for early termination of PGD based on cycle detection by exploiting the geometry of how PGD is implemented in practice and show that it can produce large speedup factors while providing the \emph{exact} same estimate of model robustness as standard PGD. This method substantially speeds up PGD without sacrificing any attack strength, enabling evaluations of robustness that were previously computationally intractable.

* To appear in the 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Via

Access Paper or Ask Questions

Differentially Private Iterative Screening Rules for Linear Regression

Feb 25, 2025

Amol Khanna, Fred Lu, Edward Raff

Figure 1 for Differentially Private Iterative Screening Rules for Linear Regression

Figure 2 for Differentially Private Iterative Screening Rules for Linear Regression

Figure 3 for Differentially Private Iterative Screening Rules for Linear Regression

Figure 4 for Differentially Private Iterative Screening Rules for Linear Regression

Abstract:Linear $L_1$-regularized models have remained one of the simplest and most effective tools in data science. Over the past decade, screening rules have risen in popularity as a way to eliminate features when producing the sparse regression weights of $L_1$ models. However, despite the increasing need of privacy-preserving models for data analysis, to the best of our knowledge, no differentially private screening rule exists. In this paper, we develop the first private screening rule for linear regression. We initially find that this screening rule is too strong: it screens too many coefficients as a result of the private screening step. However, a weakened implementation of private screening reduces overscreening and improves performance.

* Proceedings of the 15th ACM Conference on Data and Application Security and Privacy

Via

Access Paper or Ask Questions

Multi-layer Radial Basis Function Networks for Out-of-distribution Detection

Jan 05, 2025

Amol Khanna, Chenyi Ling, Derek Everett, Edward Raff, Nathan Inkawhich

Figure 1 for Multi-layer Radial Basis Function Networks for Out-of-distribution Detection

Figure 2 for Multi-layer Radial Basis Function Networks for Out-of-distribution Detection

Figure 3 for Multi-layer Radial Basis Function Networks for Out-of-distribution Detection

Figure 4 for Multi-layer Radial Basis Function Networks for Out-of-distribution Detection

Abstract:Existing methods for out-of-distribution (OOD) detection use various techniques to produce a score, separate from classification, that determines how ``OOD'' an input is. Our insight is that OOD detection can be simplified by using a neural network architecture which can effectively merge classification and OOD detection into a single step. Radial basis function networks (RBFNs) inherently link classification confidence and OOD detection; however, these networks have lost popularity due to the difficult of training them in a multi-layer fashion. In this work, we develop a multi-layer radial basis function network (MLRBFN) which can be easily trained. To ensure that these networks are also effective for OOD detection, we develop a novel depression mechanism. We apply MLRBFNs as standalone classifiers and as heads on top of pretrained feature extractors, and find that they are competitive with commonly used methods for OOD detection. Our MLRBFN architecture demonstrates a promising new direction for OOD detection methods.

Via

Access Paper or Ask Questions

Position: Challenges and Opportunities for Differential Privacy in the U.S. Federal Government

Oct 21, 2024

Amol Khanna, Adam McCormick, Andre Nguyen, Chris Aguirre, Edward Raff

Abstract:In this article, we seek to elucidate challenges and opportunities for differential privacy within the federal government setting, as seen by a team of differential privacy researchers, privacy lawyers, and data scientists working closely with the U.S. government. After introducing differential privacy, we highlight three significant challenges which currently restrict the use of differential privacy in the U.S. government. We then provide two examples where differential privacy can enhance the capabilities of government agencies. The first example highlights how the quantitative nature of differential privacy allows policy security officers to release multiple versions of analyses with different levels of privacy. The second example, which we believe is a novel realization, indicates that differential privacy can be used to improve staffing efficiency in classified applications. We hope that this article can serve as a nontechnical resource which can help frame future action from the differential privacy community, privacy regulators, security officers, and lawmakers.

* 2nd Workshop on Regulatable ML at NeurIPS 2024

Via

Access Paper or Ask Questions

Feature Selection from Differentially Private Correlations

Aug 20, 2024

Ryan Swope, Amol Khanna, Philip Doldo, Saptarshi Roy, Edward Raff

Figure 1 for Feature Selection from Differentially Private Correlations

Figure 2 for Feature Selection from Differentially Private Correlations

Figure 3 for Feature Selection from Differentially Private Correlations

Figure 4 for Feature Selection from Differentially Private Correlations

Abstract:Data scientists often seek to identify the most important features in high-dimensional datasets. This can be done through $L_1$-regularized regression, but this can become inefficient for very high-dimensional datasets. Additionally, high-dimensional regression can leak information about individual datapoints in a dataset. In this paper, we empirically evaluate the established baseline method for feature selection with differential privacy, the two-stage selection technique, and show that it is not stable under sparsity. This makes it perform poorly on real-world datasets, so we consider a different approach to private feature selection. We employ a correlations-based order statistic to choose important features from a dataset and privatize them to ensure that the results do not leak information about individual datapoints. We find that our method significantly outperforms the established baseline for private feature selection on many datasets.

* To appear in Proceedings of the 17th ACM Workshop on Artificial Intelligence and Security, 2024

Via

Access Paper or Ask Questions

SoK: A Review of Differentially Private Linear Models For High-Dimensional Data

Apr 01, 2024

Amol Khanna, Edward Raff, Nathan Inkawhich

Figure 1 for SoK: A Review of Differentially Private Linear Models For High-Dimensional Data

Figure 2 for SoK: A Review of Differentially Private Linear Models For High-Dimensional Data

Figure 3 for SoK: A Review of Differentially Private Linear Models For High-Dimensional Data

Figure 4 for SoK: A Review of Differentially Private Linear Models For High-Dimensional Data

Abstract:Linear models are ubiquitous in data science, but are particularly prone to overfitting and data memorization in high dimensions. To guarantee the privacy of training data, differential privacy can be used. Many papers have proposed optimization techniques for high-dimensional differentially private linear models, but a systematic comparison between these methods does not exist. We close this gap by providing a comprehensive review of optimization methods for private high-dimensional linear models. Empirical tests on all methods demonstrate robust and coordinate-optimized algorithms perform best, which can inform future research. Code for implementing all methods is released online.

* 21 pages, 7 figures. To be published at the 2nd IEEE Conference on Secure and Trustworthy Machine Learning (SaTML)

Via

Access Paper or Ask Questions

Comprehensive OOD Detection Improvements

Jan 18, 2024

Anish Lakkapragada, Amol Khanna, Edward Raff, Nathan Inkawhich

Abstract:As machine learning becomes increasingly prevalent in impactful decisions, recognizing when inference data is outside the model's expected input distribution is paramount for giving context to predictions. Out-of-distribution (OOD) detection methods have been created for this task. Such methods can be split into representation-based or logit-based methods from whether they respectively utilize the model's embeddings or predictions for OOD detection. In contrast to most papers which solely focus on one such group, we address both. We employ dimensionality reduction on feature embeddings in representation-based methods for both time speedups and improved performance. Additionally, we propose DICE-COL, a modification of the popular logit-based method Directed Sparsification (DICE) that resolves an unnoticed flaw. We demonstrate the effectiveness of our methods on the OpenOODv1.5 benchmark framework, where they significantly improve performance and set state-of-the-art results.

Via

Access Paper or Ask Questions

Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations

Oct 30, 2023

Edward Raff, Amol Khanna, Fred Lu

Figure 1 for Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations

Figure 2 for Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations

Figure 3 for Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations

Figure 4 for Scaling Up Differentially Private LASSO Regularized Logistic Regression via Faster Frank-Wolfe Iterations

Abstract:To the best of our knowledge, there are no methods today for training differentially private regression models on sparse input data. To remedy this, we adapt the Frank-Wolfe algorithm for $L_1$ penalized linear regression to be aware of sparse inputs and to use them effectively. In doing so, we reduce the training time of the algorithm from $\mathcal{O}( T D S + T N S)$ to $\mathcal{O}(N S + T \sqrt{D} \log{D} + T S^2)$, where $T$ is the number of iterations and a sparsity rate $S$ of a dataset with $N$ rows and $D$ features. Our results demonstrate that this procedure can reduce runtime by a factor of up to $2,200\times$, depending on the value of the privacy parameter $\epsilon$ and the sparsity of the dataset.

* To appear in the 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

Via

Access Paper or Ask Questions

Sparse Private LASSO Logistic Regression

Apr 29, 2023

Amol Khanna, Fred Lu, Edward Raff, Brian Testa

Figure 1 for Sparse Private LASSO Logistic Regression

Figure 2 for Sparse Private LASSO Logistic Regression

Figure 3 for Sparse Private LASSO Logistic Regression

Figure 4 for Sparse Private LASSO Logistic Regression

Abstract:LASSO regularized logistic regression is particularly useful for its built-in feature selection, allowing coefficients to be removed from deployment and producing sparse solutions. Differentially private versions of LASSO logistic regression have been developed, but generally produce dense solutions, reducing the intrinsic utility of the LASSO penalty. In this paper, we present a differentially private method for sparse logistic regression that maintains hard zeros. Our key insight is to first train a non-private LASSO logistic regression model to determine an appropriate privatized number of non-zero coefficients to use in final model selection. To demonstrate our method's performance, we run experiments on synthetic and real-world datasets.

* 20 pages, 5 figures

Via

Access Paper or Ask Questions