Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jocelyn T. Chi

Sketched Gaussian Model Linear Discriminant Analysis via the Randomized Kaczmarz Method

Nov 10, 2022

Jocelyn T. Chi, Deanna Needell

Figure 1 for Sketched Gaussian Model Linear Discriminant Analysis via the Randomized Kaczmarz Method

Figure 2 for Sketched Gaussian Model Linear Discriminant Analysis via the Randomized Kaczmarz Method

Figure 3 for Sketched Gaussian Model Linear Discriminant Analysis via the Randomized Kaczmarz Method

Figure 4 for Sketched Gaussian Model Linear Discriminant Analysis via the Randomized Kaczmarz Method

Abstract:We present sketched linear discriminant analysis, an iterative randomized approach to binary-class Gaussian model linear discriminant analysis (LDA) for very large data. We harness a least squares formulation and mobilize the stochastic gradient descent framework. Therefore, we obtain a randomized classifier with performance that is very comparable to that of full data LDA while requiring access to only one row of the training data at a time. We present convergence guarantees for the sketched predictions on new data within a fixed number of iterations. These guarantees account for both the Gaussian modeling assumptions on the data and algorithmic randomness from the sketching procedure. Finally, we demonstrate performance with varying step-sizes and numbers of iterations. Our numerical experiments demonstrate that sketched LDA can offer a very viable alternative to full data LDA when the data may be too large for full data analysis.

Via

Access Paper or Ask Questions

Randomized Least Squares Regression: Combining Model- and Algorithm-Induced Uncertainties

Aug 17, 2018

Jocelyn T. Chi, Ilse C. F. Ipsen

Abstract:We analyze the uncertainties in the minimum norm solution of full-rank regression problems, arising from Gaussian linear models, computed by randomized (row-wise sampling and, more generally, sketching) algorithms. From a deterministic perspective, our structural perturbation bounds imply that least squares problems are less sensitive to multiplicative perturbations than to additive perturbations. From a probabilistic perspective, our expressions for the total expectation and variance with regard to both model- and algorithm-induced uncertainties, are exact, hold for general sketching matrices, and make no assumptions on the rank of the sketched matrix. The relative differences between the total bias and variance on the one hand, and the model bias and variance on the other hand, are governed by two factors: (i) the expected rank deficiency of the sketched matrix, and (ii) the expected difference between projectors associated with the original and the sketched problems. A simple example, based on uniform sampling with replacement, illustrates the statistical quantities.

Via

Access Paper or Ask Questions