Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:An analytic theory of shallow networks dynamics for hinge loss classification

Jun 19, 2020

Franco Pellegrini, Giulio Biroli

Figure 1 for An analytic theory of shallow networks dynamics for hinge loss classification

Figure 2 for An analytic theory of shallow networks dynamics for hinge loss classification

Figure 3 for An analytic theory of shallow networks dynamics for hinge loss classification

Figure 4 for An analytic theory of shallow networks dynamics for hinge loss classification

Share this with someone who'll enjoy it:

Abstract:Neural networks have been shown to perform incredibly well in classification tasks over structured high-dimensional datasets. However, the learning dynamics of such networks is still poorly understood. In this paper we study in detail the training dynamics of a simple type of neural network: a single hidden layer trained to perform a classification task. We show that in a suitable mean-field limit this case maps to a single-node learning problem with a time-dependent dataset determined self-consistently from the average nodes population. We specialize our theory to the prototypical case of a linearly separable dataset and a linear hinge loss, for which the dynamics can be explicitly solved. This allow us to address in a simple setting several phenomena appearing in modern networks such as slowing down of training dynamics, crossover between rich and lazy learning, and overfitting. Finally, we asses the limitations of mean-field theory by studying the case of large but finite number of nodes and of training samples.

* 16 pages, 6 figures

View paper on

Share this with someone who'll enjoy it:

Title:An analytic theory of shallow networks dynamics for hinge loss classification

Paper and Code