Picture for Shuhua Yu

Shuhua Yu

RMNP: Row-Momentum Normalized Preconditioning for Scalable Matrix-Based Optimization

Add code
Mar 20, 2026
Viaarxiv icon

HTMuon: Improving Muon via Heavy-Tailed Spectral Correction

Add code
Mar 10, 2026
Viaarxiv icon

Distributed Sign Momentum with Local Steps for Training Transformers

Add code
Nov 26, 2024
Viaarxiv icon

Large Deviations and Improved Mean-squared Error Rates of Nonlinear SGD: Heavy-tailed Noise and Power of Symmetry

Add code
Oct 21, 2024
Figure 1 for Large Deviations and Improved Mean-squared Error Rates of Nonlinear SGD: Heavy-tailed Noise and Power of Symmetry
Figure 2 for Large Deviations and Improved Mean-squared Error Rates of Nonlinear SGD: Heavy-tailed Noise and Power of Symmetry
Viaarxiv icon

Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees

Add code
Oct 17, 2024
Figure 1 for Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees
Figure 2 for Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees
Figure 3 for Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees
Figure 4 for Nonlinear Stochastic Gradient Descent and Heavy-tailed Noise: A Unified Framework and High-probability Guarantees
Viaarxiv icon

Revisiting Image Classifier Training for Improved Certified Robust Defense against Adversarial Patches

Add code
Jun 22, 2023
Figure 1 for Revisiting Image Classifier Training for Improved Certified Robust Defense against Adversarial Patches
Figure 2 for Revisiting Image Classifier Training for Improved Certified Robust Defense against Adversarial Patches
Figure 3 for Revisiting Image Classifier Training for Improved Certified Robust Defense against Adversarial Patches
Figure 4 for Revisiting Image Classifier Training for Improved Certified Robust Defense against Adversarial Patches
Viaarxiv icon