Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Memorizing Gaussians with no over-parameterizaion via gradient decent on neural networks

Mar 28, 2020

Amit Daniely

Share this with someone who'll enjoy it:

Abstract:We prove that a single step of gradient decent over depth two network, with $q$ hidden neurons, starting from orthogonal initialization, can memorize $\Omega\left(\frac{dq}{\log^4(d)}\right)$ independent and randomly labeled Gaussians in $\mathbb{R}^d$. The result is valid for a large class of activation functions, which includes the absolute value.

View paper on

Share this with someone who'll enjoy it:

Title:Memorizing Gaussians with no over-parameterizaion via gradient decent on neural networks

Paper and Code