Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Exit Time Analysis for Approximations of Gradient Descent Trajectories Around Saddle Points

Jun 01, 2020

Rishabh Dixit, Waheed U. Bajwa

Figure 1 for Exit Time Analysis for Approximations of Gradient Descent Trajectories Around Saddle Points

Figure 2 for Exit Time Analysis for Approximations of Gradient Descent Trajectories Around Saddle Points

Figure 3 for Exit Time Analysis for Approximations of Gradient Descent Trajectories Around Saddle Points

Figure 4 for Exit Time Analysis for Approximations of Gradient Descent Trajectories Around Saddle Points

Share this with someone who'll enjoy it:

Abstract:This paper considers the problem of understanding the exit time for trajectories of gradient-related first-order methods from saddle neighborhoods under some initial boundary conditions. Given the `flat' geometry around saddle points, first-order methods can struggle in escaping these regions in a fast manner due to the small magnitudes of gradients encountered. In particular, while it is known that gradient-related first-order methods escape strict-saddle neighborhoods, existing literature does not explicitly leverage the local geometry around saddle points in order to control behavior of gradient trajectories. It is in this context that this paper puts forth a rigorous geometric analysis of the gradient-descent method around strict-saddle neighborhoods using matrix perturbation theory. In doing so, it provides a key result that can be used to generate an approximate gradient trajectory for any given initial conditions. In addition, the analysis leads to a linear exit-time solution for gradient-descent method under certain necessary initial conditions for a class of strict-saddle functions.

* 32 pages; preprint of a paper under review

View paper on

Share this with someone who'll enjoy it:

Title:Exit Time Analysis for Approximations of Gradient Descent Trajectories Around Saddle Points

Paper and Code