Small nonlinearities in activation functions create bad local minima in neural networks

Add code
Sep 28, 2018

Share this with someone who'll enjoy it:

View paper onarxiv iconopen_review iconOpenReview

Share this with someone who'll enjoy it: