Picture for Yatin Dandi

Yatin Dandi

Online Learning and Information Exponents: On The Importance of Batch size, and Time/Complexity Tradeoffs

Add code
Jun 04, 2024
Figure 1 for Online Learning and Information Exponents: On The Importance of Batch size, and Time/Complexity Tradeoffs
Figure 2 for Online Learning and Information Exponents: On The Importance of Batch size, and Time/Complexity Tradeoffs
Figure 3 for Online Learning and Information Exponents: On The Importance of Batch size, and Time/Complexity Tradeoffs
Figure 4 for Online Learning and Information Exponents: On The Importance of Batch size, and Time/Complexity Tradeoffs
Viaarxiv icon

Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions

Add code
May 24, 2024
Figure 1 for Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions
Figure 2 for Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions
Figure 3 for Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions
Figure 4 for Repetita Iuvant: Data Repetition Allows SGD to Learn High-Dimensional Multi-Index Functions
Viaarxiv icon

Fundamental limits of weak learnability in high-dimensional multi-index models

Add code
May 24, 2024
Figure 1 for Fundamental limits of weak learnability in high-dimensional multi-index models
Figure 2 for Fundamental limits of weak learnability in high-dimensional multi-index models
Figure 3 for Fundamental limits of weak learnability in high-dimensional multi-index models
Figure 4 for Fundamental limits of weak learnability in high-dimensional multi-index models
Viaarxiv icon

Asymptotics of feature learning in two-layer networks after one gradient-step

Add code
Feb 07, 2024
Figure 1 for Asymptotics of feature learning in two-layer networks after one gradient-step
Figure 2 for Asymptotics of feature learning in two-layer networks after one gradient-step
Figure 3 for Asymptotics of feature learning in two-layer networks after one gradient-step
Figure 4 for Asymptotics of feature learning in two-layer networks after one gradient-step
Viaarxiv icon

The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap Exponents

Add code
Feb 05, 2024
Figure 1 for The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap Exponents
Figure 2 for The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap Exponents
Figure 3 for The Benefits of Reusing Batches for Gradient Descent in Two-Layer Networks: Breaking the Curse of Information and Leap Exponents
Viaarxiv icon

A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning

Add code
Sep 09, 2023
Figure 1 for A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning
Figure 2 for A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning
Figure 3 for A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning
Figure 4 for A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning
Viaarxiv icon

Sampling with flows, diffusion and autoregressive neural networks: A spin-glass perspective

Add code
Aug 27, 2023
Viaarxiv icon

Learning Two-Layer Neural Networks, One Step at a Time

Add code
May 29, 2023
Figure 1 for Learning Two-Layer Neural Networks, One  Step at a Time
Figure 2 for Learning Two-Layer Neural Networks, One  Step at a Time
Figure 3 for Learning Two-Layer Neural Networks, One  Step at a Time
Figure 4 for Learning Two-Layer Neural Networks, One  Step at a Time
Viaarxiv icon

Universality laws for Gaussian mixtures in generalized linear models

Add code
Feb 17, 2023
Figure 1 for Universality laws for Gaussian mixtures in generalized linear models
Figure 2 for Universality laws for Gaussian mixtures in generalized linear models
Figure 3 for Universality laws for Gaussian mixtures in generalized linear models
Figure 4 for Universality laws for Gaussian mixtures in generalized linear models
Viaarxiv icon

Data-heterogeneity-aware Mixing for Decentralized Learning

Add code
Apr 13, 2022
Figure 1 for Data-heterogeneity-aware Mixing for Decentralized Learning
Figure 2 for Data-heterogeneity-aware Mixing for Decentralized Learning
Figure 3 for Data-heterogeneity-aware Mixing for Decentralized Learning
Figure 4 for Data-heterogeneity-aware Mixing for Decentralized Learning
Viaarxiv icon