Polylogarithmic width suffices for gradient descent to achieve arbitrarily small test error with shallow ReLU networks

Add code
Sep 29, 2019

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: