Picture for Peter Richtárik

Peter Richtárik

King Abdullah University of Science and Technology

Drop-Muon: Update Less, Converge Faster

Add code
Oct 02, 2025
Viaarxiv icon

Error Feedback for Muon and Friends

Add code
Oct 01, 2025
Viaarxiv icon

Non-Euclidean Broximal Point Method: A Blueprint for Geometry-Aware Optimization

Add code
Oct 01, 2025
Viaarxiv icon

Gluon: Making Muon & Scion Great Again! (Bridging Theory and Practice of LMO-based Optimizers for LLMs)

Add code
May 19, 2025
Viaarxiv icon

BurTorch: Revisiting Training from First Principles by Coupling Autodiff, Math Optimization, and Systems

Add code
Mar 18, 2025
Viaarxiv icon

Smoothed Normalization for Efficient Distributed Private Optimization

Add code
Feb 19, 2025
Viaarxiv icon

A Novel Unified Parametric Assumption for Nonconvex Optimization

Add code
Feb 17, 2025
Viaarxiv icon

The Ball-Proximal (="Broximal") Point Method: a New Algorithm, Convergence Theory, and Applications

Add code
Feb 04, 2025
Figure 1 for The Ball-Proximal (="Broximal") Point Method: a New Algorithm, Convergence Theory, and Applications
Figure 2 for The Ball-Proximal (="Broximal") Point Method: a New Algorithm, Convergence Theory, and Applications
Figure 3 for The Ball-Proximal (="Broximal") Point Method: a New Algorithm, Convergence Theory, and Applications
Figure 4 for The Ball-Proximal (="Broximal") Point Method: a New Algorithm, Convergence Theory, and Applications
Viaarxiv icon

ATA: Adaptive Task Allocation for Efficient Resource Management in Distributed Machine Learning

Add code
Feb 02, 2025
Viaarxiv icon

Symmetric Pruning of Large Language Models

Add code
Jan 31, 2025
Viaarxiv icon