Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ekkehard Schnoor

Incorporating priors in learning: a random matrix study under a teacher-student framework

Sep 26, 2025

Malik Tiomoko, Ekkehard Schnoor

Abstract:Regularized linear regression is central to machine learning, yet its high-dimensional behavior with informative priors remains poorly understood. We provide the first exact asymptotic characterization of training and test risks for maximum a posteriori (MAP) regression with Gaussian priors centered at a domain-informed initialization. Our framework unifies ridge regression, least squares, and prior-informed estimators, and -- using random matrix theory -- yields closed-form risk formulas that expose the bias-variance-prior tradeoff, explain double descent, and quantify prior mismatch. We also identify a closed-form minimizer of test risk, enabling a simple estimator of the optimal regularization parameter. Simulations confirm the theory with high accuracy. By connecting Bayesian priors, classical regularization, and modern asymptotics, our results provide both conceptual clarity and practical guidance for learning with structured prior knowledge.

* 5 pages, 4 figures

Via

Access Paper or Ask Questions

High-Dimensional Analysis of Bootstrap Ensemble Classifiers

May 20, 2025

Hamza Cherkaoui, Malik Tiomoko, Mohamed El Amine Seddik, Cosme Louart, Ekkehard Schnoor, Balazs Kegl

Abstract:Bootstrap methods have long been a cornerstone of ensemble learning in machine learning. This paper presents a theoretical analysis of bootstrap techniques applied to the Least Square Support Vector Machine (LSSVM) ensemble in the context of large and growing sample sizes and feature dimensionalities. Leveraging tools from Random Matrix Theory, we investigate the performance of this classifier that aggregates decision functions from multiple weak classifiers, each trained on different subsets of the data. We provide insights into the use of bootstrap methods in high-dimensional settings, enhancing our understanding of their impact. Based on these findings, we propose strategies to select the number of subsets and the regularization parameter that maximize the performance of the LSSVM. Empirical experiments on synthetic and real-world datasets validate our theoretical results.

Via

Access Paper or Ask Questions

Generalization Error Bounds for Iterative Recovery Algorithms Unfolded as Neural Networks

Dec 08, 2021

Ekkehard Schnoor, Arash Behboodi, Holger Rauhut

Figure 1 for Generalization Error Bounds for Iterative Recovery Algorithms Unfolded as Neural Networks

Figure 2 for Generalization Error Bounds for Iterative Recovery Algorithms Unfolded as Neural Networks

Figure 3 for Generalization Error Bounds for Iterative Recovery Algorithms Unfolded as Neural Networks

Figure 4 for Generalization Error Bounds for Iterative Recovery Algorithms Unfolded as Neural Networks

Abstract:Motivated by the learned iterative soft thresholding algorithm (LISTA), we introduce a general class of neural networks suitable for sparse reconstruction from few linear measurements. By allowing a wide range of degrees of weight-sharing between the layers, we enable a unified analysis for very different neural network types, ranging from recurrent ones to networks more similar to standard feedforward neural networks. Based on training samples, via empirical risk minimization we aim at learning the optimal network parameters and thereby the optimal network that reconstructs signals from their low-dimensional linear measurements. We derive generalization bounds by analyzing the Rademacher complexity of hypothesis classes consisting of such deep networks, that also take into account the thresholding parameters. We obtain estimates of the sample complexity that essentially depend only linearly on the number of parameters and on the depth. We apply our main result to obtain specific generalization bounds for several practical examples, including different algorithms for (implicit) dictionary learning, and convolutional neural networks.

* 29 pages, 6 figures

Via

Access Paper or Ask Questions

Generalization bounds for deep thresholding networks

Oct 29, 2020

Arash Behboodi, Holger Rauhut, Ekkehard Schnoor

Figure 1 for Generalization bounds for deep thresholding networks

Figure 2 for Generalization bounds for deep thresholding networks

Figure 3 for Generalization bounds for deep thresholding networks

Abstract:We consider compressive sensing in the scenario where the sparsity basis (dictionary) is not known in advance, but needs to be learned from examples. Motivated by the well-known iterative soft thresholding algorithm for the reconstruction, we define deep networks parametrized by the dictionary, which we call deep thresholding networks. Based on training samples, we aim at learning the optimal sparsifying dictionary and thereby the optimal network that reconstructs signals from their low-dimensional linear measurements. The dictionary learning is performed via minimizing the empirical risk. We derive generalization bounds by analyzing the Rademacher complexity of hypothesis classes consisting of such deep networks. We obtain estimates of the sample complexity that depend only linearly on the dimensions and on the depth.

* 19 pages, 4 figures

Via

Access Paper or Ask Questions