Picture for Rustem Islamov

Rustem Islamov

On the Role of Batch Size in Stochastic Conditional Gradient Methods

Add code
Mar 22, 2026
Viaarxiv icon

Non-Euclidean Gradient Descent Operates at the Edge of Stability

Add code
Mar 05, 2026
Viaarxiv icon

Adaptive Methods Are Preferable in High Privacy Settings: An SDE Perspective

Add code
Mar 03, 2026
Viaarxiv icon

Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size

Add code
Aug 20, 2025
Viaarxiv icon

Safe-EF: Error Feedback for Nonsmooth Constrained Optimization

Add code
May 09, 2025
Viaarxiv icon

Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs

Add code
Feb 24, 2025
Figure 1 for Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs
Figure 2 for Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs
Figure 3 for Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs
Figure 4 for Unbiased and Sign Compression in Distributed Learning: Comparing Noise Resilience via SDEs
Viaarxiv icon

Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy

Add code
Feb 17, 2025
Figure 1 for Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy
Figure 2 for Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy
Figure 3 for Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy
Figure 4 for Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy
Viaarxiv icon

Adaptive Methods through the Lens of SDEs: Theoretical Insights on the Role of Noise

Add code
Nov 24, 2024
Viaarxiv icon

Loss Landscape Characterization of Neural Networks without Over-Parametrization

Add code
Oct 17, 2024
Figure 1 for Loss Landscape Characterization of Neural Networks without Over-Parametrization
Figure 2 for Loss Landscape Characterization of Neural Networks without Over-Parametrization
Figure 3 for Loss Landscape Characterization of Neural Networks without Over-Parametrization
Figure 4 for Loss Landscape Characterization of Neural Networks without Over-Parametrization
Viaarxiv icon

Near Optimal Decentralized Optimization with Compression and Momentum Tracking

Add code
May 30, 2024
Figure 1 for Near Optimal Decentralized Optimization with Compression and Momentum Tracking
Figure 2 for Near Optimal Decentralized Optimization with Compression and Momentum Tracking
Figure 3 for Near Optimal Decentralized Optimization with Compression and Momentum Tracking
Figure 4 for Near Optimal Decentralized Optimization with Compression and Momentum Tracking
Viaarxiv icon