Picture for Sashank Reddi

Sashank Reddi

Structured Preconditioners in Adaptive Optimization: A Unified Analysis

Add code
Mar 13, 2025
Figure 1 for Structured Preconditioners in Adaptive Optimization: A Unified Analysis
Figure 2 for Structured Preconditioners in Adaptive Optimization: A Unified Analysis
Figure 3 for Structured Preconditioners in Adaptive Optimization: A Unified Analysis
Figure 4 for Structured Preconditioners in Adaptive Optimization: A Unified Analysis
Viaarxiv icon

Efficient Stagewise Pretraining via Progressive Subnetworks

Add code
Feb 08, 2024
Figure 1 for Efficient Stagewise Pretraining via Progressive Subnetworks
Figure 2 for Efficient Stagewise Pretraining via Progressive Subnetworks
Figure 3 for Efficient Stagewise Pretraining via Progressive Subnetworks
Figure 4 for Efficient Stagewise Pretraining via Progressive Subnetworks
Viaarxiv icon

The Inductive Bias of Flatness Regularization for Deep Matrix Factorization

Add code
Jun 22, 2023
Figure 1 for The Inductive Bias of Flatness Regularization for Deep Matrix Factorization
Figure 2 for The Inductive Bias of Flatness Regularization for Deep Matrix Factorization
Figure 3 for The Inductive Bias of Flatness Regularization for Deep Matrix Factorization
Figure 4 for The Inductive Bias of Flatness Regularization for Deep Matrix Factorization
Viaarxiv icon

FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients

Add code
Feb 16, 2022
Figure 1 for FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients
Figure 2 for FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients
Figure 3 for FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients
Figure 4 for FedLite: A Scalable Approach for Federated Learning on Resource-constrained Clients
Viaarxiv icon

Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces

Add code
May 12, 2021
Figure 1 for Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces
Figure 2 for Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces
Figure 3 for Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces
Figure 4 for Disentangling Sampling and Labeling Bias for Learning in Large-Output Spaces
Viaarxiv icon

Federated Composite Optimization

Add code
Nov 17, 2020
Figure 1 for Federated Composite Optimization
Figure 2 for Federated Composite Optimization
Figure 3 for Federated Composite Optimization
Figure 4 for Federated Composite Optimization
Viaarxiv icon

Adaptive Federated Optimization

Add code
Feb 29, 2020
Figure 1 for Adaptive Federated Optimization
Figure 2 for Adaptive Federated Optimization
Figure 3 for Adaptive Federated Optimization
Figure 4 for Adaptive Federated Optimization
Viaarxiv icon

Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets

Add code
Feb 20, 2020
Figure 1 for Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets
Figure 2 for Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets
Figure 3 for Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets
Figure 4 for Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets
Viaarxiv icon

Learning to Learn by Zeroth-Order Oracle

Add code
Oct 21, 2019
Figure 1 for Learning to Learn by Zeroth-Order Oracle
Figure 2 for Learning to Learn by Zeroth-Order Oracle
Figure 3 for Learning to Learn by Zeroth-Order Oracle
Figure 4 for Learning to Learn by Zeroth-Order Oracle
Viaarxiv icon

Large-scale randomized-coordinate descent methods with non-separable linear constraints

Add code
Jun 10, 2015
Figure 1 for Large-scale randomized-coordinate descent methods with non-separable linear constraints
Figure 2 for Large-scale randomized-coordinate descent methods with non-separable linear constraints
Figure 3 for Large-scale randomized-coordinate descent methods with non-separable linear constraints
Figure 4 for Large-scale randomized-coordinate descent methods with non-separable linear constraints
Viaarxiv icon