Picture for Michael Gastpar

Michael Gastpar

Local to Global: Learning Dynamics and Effect of Initialization for Transformers

Add code
Jun 05, 2024
Figure 1 for Local to Global: Learning Dynamics and Effect of Initialization for Transformers
Figure 2 for Local to Global: Learning Dynamics and Effect of Initialization for Transformers
Figure 3 for Local to Global: Learning Dynamics and Effect of Initialization for Transformers
Figure 4 for Local to Global: Learning Dynamics and Effect of Initialization for Transformers
Viaarxiv icon

The Fundamental Limits of Least-Privilege Learning

Add code
Feb 19, 2024
Figure 1 for The Fundamental Limits of Least-Privilege Learning
Figure 2 for The Fundamental Limits of Least-Privilege Learning
Figure 3 for The Fundamental Limits of Least-Privilege Learning
Figure 4 for The Fundamental Limits of Least-Privilege Learning
Viaarxiv icon

Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains

Add code
Feb 06, 2024
Figure 1 for Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Figure 2 for Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Figure 3 for Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Figure 4 for Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
Viaarxiv icon

Batch Universal Prediction

Add code
Feb 06, 2024
Viaarxiv icon

Fantastic Generalization Measures are Nowhere to be Found

Add code
Sep 24, 2023
Figure 1 for Fantastic Generalization Measures are Nowhere to be Found
Figure 2 for Fantastic Generalization Measures are Nowhere to be Found
Viaarxiv icon

Lower Bounds on the Bayesian Risk via Information Measures

Add code
Mar 24, 2023
Figure 1 for Lower Bounds on the Bayesian Risk via Information Measures
Figure 2 for Lower Bounds on the Bayesian Risk via Information Measures
Figure 3 for Lower Bounds on the Bayesian Risk via Information Measures
Figure 4 for Lower Bounds on the Bayesian Risk via Information Measures
Viaarxiv icon

Asymptotically Optimal Generalization Error Bounds for Noisy, Iterative Algorithms

Add code
Feb 28, 2023
Viaarxiv icon

Finite Littlestone Dimension Implies Finite Information Complexity

Add code
Jun 27, 2022
Viaarxiv icon

From Generalisation Error to Transportation-cost Inequalities and Back

Add code
Feb 08, 2022
Viaarxiv icon

A Johnson--Lindenstrauss Framework for Randomly Initialized CNNs

Add code
Nov 03, 2021
Figure 1 for A Johnson--Lindenstrauss Framework for Randomly Initialized CNNs
Figure 2 for A Johnson--Lindenstrauss Framework for Randomly Initialized CNNs
Viaarxiv icon