Picture for Feihu Huang

Feihu Huang

HomeAdam: Adam and AdamW Algorithms Sometimes Go Home to Obtain Better Provable Generalization

Add code
Mar 03, 2026
Viaarxiv icon

LiMuon: Light and Fast Muon Optimizer for Large Models

Add code
Sep 19, 2025
Figure 1 for LiMuon: Light and Fast Muon Optimizer for Large Models
Figure 2 for LiMuon: Light and Fast Muon Optimizer for Large Models
Figure 3 for LiMuon: Light and Fast Muon Optimizer for Large Models
Figure 4 for LiMuon: Light and Fast Muon Optimizer for Large Models
Viaarxiv icon

Faster Adaptive Decentralized Learning Algorithms

Add code
Aug 19, 2024
Figure 1 for Faster Adaptive Decentralized Learning Algorithms
Figure 2 for Faster Adaptive Decentralized Learning Algorithms
Figure 3 for Faster Adaptive Decentralized Learning Algorithms
Figure 4 for Faster Adaptive Decentralized Learning Algorithms
Viaarxiv icon

Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization

Add code
Jul 25, 2024
Figure 1 for Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization
Figure 2 for Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization
Figure 3 for Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization
Figure 4 for Optimal Hessian/Jacobian-Free Nonconvex-PL Bilevel Optimization
Viaarxiv icon

Adaptive Mirror Descent Bilevel Optimization

Add code
Nov 18, 2023
Figure 1 for Adaptive Mirror Descent Bilevel Optimization
Viaarxiv icon

Near-Optimal Decentralized Momentum Method for Nonconvex-PL Minimax Problems

Add code
Apr 21, 2023
Figure 1 for Near-Optimal Decentralized Momentum Method for Nonconvex-PL Minimax Problems
Viaarxiv icon

Enhanced Adaptive Gradient Algorithms for Nonconvex-PL Minimax Optimization

Add code
Mar 13, 2023
Figure 1 for Enhanced Adaptive Gradient Algorithms for Nonconvex-PL Minimax Optimization
Viaarxiv icon

On Momentum-Based Gradient Methods for Bilevel Optimization with Nonconvex Lower-Level

Add code
Mar 07, 2023
Figure 1 for On Momentum-Based Gradient Methods for Bilevel Optimization with Nonconvex Lower-Level
Figure 2 for On Momentum-Based Gradient Methods for Bilevel Optimization with Nonconvex Lower-Level
Figure 3 for On Momentum-Based Gradient Methods for Bilevel Optimization with Nonconvex Lower-Level
Figure 4 for On Momentum-Based Gradient Methods for Bilevel Optimization with Nonconvex Lower-Level
Viaarxiv icon

Communication-Efficient Federated Bilevel Optimization with Local and Global Lower Level Problems

Add code
Feb 13, 2023
Figure 1 for Communication-Efficient Federated Bilevel Optimization with Local and Global Lower Level Problems
Figure 2 for Communication-Efficient Federated Bilevel Optimization with Local and Global Lower Level Problems
Figure 3 for Communication-Efficient Federated Bilevel Optimization with Local and Global Lower Level Problems
Figure 4 for Communication-Efficient Federated Bilevel Optimization with Local and Global Lower Level Problems
Viaarxiv icon

FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted Dual Averaging

Add code
Feb 13, 2023
Figure 1 for FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted Dual Averaging
Figure 2 for FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted Dual Averaging
Figure 3 for FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted Dual Averaging
Figure 4 for FedDA: Faster Framework of Local Adaptive Gradient Methods via Restarted Dual Averaging
Viaarxiv icon