Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Brian Bell

Efficient Analysis of the Distilled Neural Tangent Kernel

Feb 11, 2026

Jamie Mahowald, Brian Bell, Alex Ho, Michael Geyer

Abstract:Neural tangent kernel (NTK) methods are computationally limited by the need to evaluate large Jacobians across many data points. Existing approaches reduce this cost primarily through projecting and sketching the Jacobian. We show that NTK computation can also be reduced by compressing the data dimension itself using NTK-tuned dataset distillation. We demonstrate that the neural tangent space spanned by the input data can be induced by dataset distillation, yielding a 20-100$\times$ reduction in required Jacobian calculations. We further show that per-class NTK matrices have low effective rank that is preserved by this reduction. Building on these insights, we propose the distilled neural tangent kernel (DNTK), which combines NTK-tuned dataset distillation with state-of-the-art projection methods to reduce up NTK computational complexity by up to five orders of magnitude while preserving kernel structure and predictive performance.

* 27 pages, 9 figures

Via

Access Paper or Ask Questions

Persistent Classification: A New Approach to Stability of Data and Adversarial Examples

Apr 11, 2024

Brian Bell, Michael Geyer, David Glickenstein, Keaton Hamm, Carlos Scheidegger, Amanda Fernandez, Juston Moore

Figure 1 for Persistent Classification: A New Approach to Stability of Data and Adversarial Examples

Figure 2 for Persistent Classification: A New Approach to Stability of Data and Adversarial Examples

Figure 3 for Persistent Classification: A New Approach to Stability of Data and Adversarial Examples

Figure 4 for Persistent Classification: A New Approach to Stability of Data and Adversarial Examples

Abstract:There are a number of hypotheses underlying the existence of adversarial examples for classification problems. These include the high-dimensionality of the data, high codimension in the ambient space of the data manifolds of interest, and that the structure of machine learning models may encourage classifiers to develop decision boundaries close to data points. This article proposes a new framework for studying adversarial examples that does not depend directly on the distance to the decision boundary. Similarly to the smoothed classifier literature, we define a (natural or adversarial) data point to be $(\gamma,\sigma)$-stable if the probability of the same classification is at least $\gamma$ for points sampled in a Gaussian neighborhood of the point with a given standard deviation $\sigma$. We focus on studying the differences between persistence metrics along interpolants of natural and adversarial points. We show that adversarial examples have significantly lower persistence than natural examples for large neural networks in the context of the MNIST and ImageNet datasets. We connect this lack of persistence with decision boundary geometry by measuring angles of interpolants with respect to decision boundaries. Finally, we connect this approach with robustness by developing a manifold alignment gradient metric and demonstrating the increase in robustness that can be achieved when training with the addition of this metric.

Via

Access Paper or Ask Questions

An Exact Kernel Equivalence for Finite Classification Models

Aug 09, 2023

Brian Bell, Michael Geyer, David Glickenstein, Amanda Fernandez, Juston Moore

Figure 1 for An Exact Kernel Equivalence for Finite Classification Models

Figure 2 for An Exact Kernel Equivalence for Finite Classification Models

Figure 3 for An Exact Kernel Equivalence for Finite Classification Models

Figure 4 for An Exact Kernel Equivalence for Finite Classification Models

Abstract:We explore the equivalence between neural networks and kernel methods by deriving the first exact representation of any finite-size parametric classification model trained with gradient descent as a kernel machine. We compare our exact representation to the well-known Neural Tangent Kernel (NTK) and discuss approximation error relative to the NTK and other non-exact path kernel formulations. We experimentally demonstrate that the kernel can be computed for realistic networks up to machine precision. We use this exact kernel to show that our theoretical contribution can provide useful insights into the predictions made by neural networks, particularly the way in which they generalize.

* TAG-ML at ICML 2023 in Proceedings. 8 pages, 6 figures, proofs in Appendix

Via

Access Paper or Ask Questions