Picture for Thomas Pethick

Thomas Pethick

When to use what Schatten-$p$ norm in deep learning?

Add code
Jun 13, 2026
Viaarxiv icon

Free Heavy-Tailed Lunch for Muon: A Theoretical Justification of Empirical Success

Add code
Jun 12, 2026
Viaarxiv icon

Training Neural Networks at Any Scale

Add code
Nov 14, 2025
Viaarxiv icon

Training Deep Learning Models with Norm-Constrained LMOs

Add code
Feb 11, 2025
Viaarxiv icon

SAMPa: Sharpness-aware Minimization Parallelized

Add code
Oct 14, 2024
Figure 1 for SAMPa: Sharpness-aware Minimization Parallelized
Figure 2 for SAMPa: Sharpness-aware Minimization Parallelized
Figure 3 for SAMPa: Sharpness-aware Minimization Parallelized
Figure 4 for SAMPa: Sharpness-aware Minimization Parallelized
Viaarxiv icon

Improving SAM Requires Rethinking its Optimization Formulation

Add code
Jul 17, 2024
Figure 1 for Improving SAM Requires Rethinking its Optimization Formulation
Figure 2 for Improving SAM Requires Rethinking its Optimization Formulation
Figure 3 for Improving SAM Requires Rethinking its Optimization Formulation
Figure 4 for Improving SAM Requires Rethinking its Optimization Formulation
Viaarxiv icon

Stable Nonconvex-Nonconcave Training via Linear Interpolation

Add code
Oct 20, 2023
Viaarxiv icon

Federated Learning under Covariate Shifts with Generalization Guarantees

Add code
Jun 08, 2023
Viaarxiv icon

Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problems

Add code
Feb 20, 2023
Figure 1 for Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problems
Figure 2 for Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problems
Figure 3 for Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problems
Figure 4 for Escaping limit cycles: Global convergence for constrained nonconvex-nonconcave minimax problems
Viaarxiv icon

Revisiting adversarial training for the worst-performing class

Add code
Feb 17, 2023
Viaarxiv icon