Picture for Sizhe Dang

Sizhe Dang

From $O(mn)$ to $O(r^2)$: Two-Sided Low-Rank Communication for Adam in Distributed Training with Memory Efficiency

Add code
Feb 08, 2026
Viaarxiv icon

ESSAM: A Novel Competitive Evolution Strategies Approach to Reinforcement Learning for Memory Efficient LLMs Fine-Tuning

Add code
Feb 01, 2026
Viaarxiv icon

FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed

Add code
Jun 10, 2025
Viaarxiv icon

Less is more: Embracing sparsity and interpolation with Esiformer for time series forecasting

Add code
Oct 08, 2024
Viaarxiv icon

Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer

Add code
Feb 23, 2024
Viaarxiv icon

Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition

Add code
Dec 04, 2023
Viaarxiv icon