Picture for Haishan Ye

Haishan Ye

FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed

Add code
Jun 10, 2025
Viaarxiv icon

Towards Understanding The Calibration Benefits of Sharpness-Aware Minimization

Add code
May 29, 2025
Viaarxiv icon

An Enhanced Zeroth-Order Stochastic Frank-Wolfe Framework for Constrained Finite-Sum Optimization

Add code
Jan 13, 2025
Viaarxiv icon

Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient

Add code
May 28, 2024
Figure 1 for Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient
Figure 2 for Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient
Figure 3 for Double Variance Reduction: A Smoothing Trick for Composite Optimization Problems without First-Order Gradient
Viaarxiv icon

Near-Optimal Distributed Minimax Optimization under the Second-Order Similarity

Add code
May 25, 2024
Viaarxiv icon

Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer

Add code
Feb 23, 2024
Viaarxiv icon

PPFL: A Personalized Federated Learning Framework for Heterogeneous Population

Add code
Oct 22, 2023
Viaarxiv icon

Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold

Add code
Aug 21, 2023
Figure 1 for Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold
Figure 2 for Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold
Figure 3 for Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold
Figure 4 for Decentralized Riemannian Conjugate Gradient Method on the Stiefel Manifold
Viaarxiv icon

Mirror Natural Evolution Strategies

Add code
Aug 01, 2023
Figure 1 for Mirror Natural Evolution Strategies
Figure 2 for Mirror Natural Evolution Strategies
Figure 3 for Mirror Natural Evolution Strategies
Figure 4 for Mirror Natural Evolution Strategies
Viaarxiv icon

Stochastic Distributed Optimization under Average Second-order Similarity: Algorithms and Analysis

Add code
Apr 15, 2023
Viaarxiv icon