Picture for Andrey Veprikov

Andrey Veprikov

Where Does Warm-Up Come From? Adaptive Scheduling for Norm-Constrained Optimizers

Add code
Feb 05, 2026
Viaarxiv icon

DyKAF: Dynamical Kronecker Approximation of the Fisher Information Matrix for Gradient Preconditioning

Add code
Nov 09, 2025
Figure 1 for DyKAF: Dynamical Kronecker Approximation of the Fisher Information Matrix for Gradient Preconditioning
Figure 2 for DyKAF: Dynamical Kronecker Approximation of the Fisher Information Matrix for Gradient Preconditioning
Figure 3 for DyKAF: Dynamical Kronecker Approximation of the Fisher Information Matrix for Gradient Preconditioning
Figure 4 for DyKAF: Dynamical Kronecker Approximation of the Fisher Information Matrix for Gradient Preconditioning
Viaarxiv icon

Leveraging Coordinate Momentum in SignSGD and Muon: Memory-Optimized Zero-Order

Add code
Jun 04, 2025
Viaarxiv icon

A Mathematical Model of the Hidden Feedback Loop Effect in Machine Learning Systems

Add code
May 04, 2024
Viaarxiv icon