Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction

Oct 02, 2020

Wei Deng, Qi Feng, Georgios Karagiannis, Guang Lin, Faming Liang

Figure 1 for Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction

Figure 2 for Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction

Figure 3 for Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction

Figure 4 for Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction

Share this with someone who'll enjoy it:

Abstract:Replica exchange stochastic gradient Langevin dynamics (reSGLD) has shown promise in accelerating the convergence in non-convex learning; however, an excessively large correction for avoiding biases from noisy energy estimators has limited the potential of the acceleration. To address this issue, we study the variance reduction for noisy energy estimators, which promotes much more effective swaps. Theoretically, we provide a non-asymptotic analysis on the exponential acceleration for the underlying continuous-time Markov jump process; moreover, we consider a generalized Girsanov theorem which includes the change of Poisson measure to overcome the crude discretization based on the Gr\"{o}wall's inequality and yields a much tighter error in the 2-Wasserstein ($\mathcal{W}_2$) distance. Numerically, we conduct extensive experiments and obtain the state-of-the-art results in optimization and uncertainty estimates for synthetic experiments and image data.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Accelerating Convergence of Replica Exchange Stochastic Gradient MCMC via Variance Reduction

Paper and Code