Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Measuring and regularizing networks in function space

May 21, 2018

Ari S. Benjamin, David Rolnick, Konrad Kording

Figure 1 for Measuring and regularizing networks in function space

Figure 2 for Measuring and regularizing networks in function space

Figure 3 for Measuring and regularizing networks in function space

Figure 4 for Measuring and regularizing networks in function space

Share this with someone who'll enjoy it:

Abstract:Neural network optimization is often conceptualized as optimizing parameters, but it is ultimately a matter of optimizing a function defined by inputs and outputs. However, little work has empirically evaluated network optimization in the space of possible functions and much analysis relies on Lipschitz bounds. Here, we measure the behavior of several networks in an $L^2$ Hilbert space. Lipschitz bounds appear reasonable in late optimization but not the beginning. We also observe that the function continues to change even after test error saturates. In light of this we propose a learning rule, Hilbert-constrained gradient descent (HCGD), that regularizes the distance a network can travel through $L^2$-space in any one update. HCGD should increase generalization if it is important that single updates minimally change the output function. Experiments show that HCGD reduces exploration in function space and often, but not always, improves generalization. We connect this idea to the natural gradient, which can also be derived from penalizing changes in the outputs. We conclude that decreased movement in function space is an important consideration in training neural networks.

* Submitted to to NIPS 2018

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Measuring and regularizing networks in function space

Paper and Code