Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Youran Dong

Efficient Curvature-Aware Hypergradient Approximation for Bilevel Optimization

May 04, 2025

Youran Dong, Junfeng Yang, Wei Yao, Jin Zhang

Figure 1 for Efficient Curvature-Aware Hypergradient Approximation for Bilevel Optimization

Figure 2 for Efficient Curvature-Aware Hypergradient Approximation for Bilevel Optimization

Figure 3 for Efficient Curvature-Aware Hypergradient Approximation for Bilevel Optimization

Figure 4 for Efficient Curvature-Aware Hypergradient Approximation for Bilevel Optimization

Abstract:Bilevel optimization is a powerful tool for many machine learning problems, such as hyperparameter optimization and meta-learning. Estimating hypergradients (also known as implicit gradients) is crucial for developing gradient-based methods for bilevel optimization. In this work, we propose a computationally efficient technique for incorporating curvature information into the approximation of hypergradients and present a novel algorithmic framework based on the resulting enhanced hypergradient computation. We provide convergence rate guarantees for the proposed framework in both deterministic and stochastic scenarios, particularly showing improved computational complexity over popular gradient-based methods in the deterministic setting. This improvement in complexity arises from a careful exploitation of the hypergradient structure and the inexact Newton method. In addition to the theoretical speedup, numerical experiments demonstrate the significant practical performance benefits of incorporating curvature information.

* Accepted by ICML 2025

Via

Access Paper or Ask Questions

A Single-Loop Algorithm for Decentralized Bilevel Optimization

Nov 15, 2023

Youran Dong, Shiqian Ma, Junfeng Yang, Chao Yin

Figure 1 for A Single-Loop Algorithm for Decentralized Bilevel Optimization

Figure 2 for A Single-Loop Algorithm for Decentralized Bilevel Optimization

Figure 3 for A Single-Loop Algorithm for Decentralized Bilevel Optimization

Abstract:Bilevel optimization has received more and more attention recently due to its wide applications in machine learning. In this paper, we consider bilevel optimization in decentralized networks. In particular, we propose a novel single-loop algorithm for solving decentralized bilevel optimization with strongly convex lower level problem. Our algorithm is fully single-loop and does not require heavy matrix-vector multiplications when approximating the hypergradient. Moreover, unlike existing methods for decentralized bilevel optimization and federated bilevel optimization, our algorithm does not require any gradient heterogeneity assumption. Our analysis shows that the proposed algorithm achieves the best known convergence rate for bilevel optimization algorithms.

Via

Access Paper or Ask Questions