Picture for Michael Gastpar

Michael Gastpar

The Conditional Regret-Capacity Theorem for Batch Universal Prediction

Add code
Aug 14, 2025
Viaarxiv icon

What One Cannot, Two Can: Two-Layer Transformers Provably Represent Induction Heads on Any-Order Markov Chains

Add code
Aug 10, 2025
Viaarxiv icon

Leveraging Sparsity for Sample-Efficient Preference Learning: A Theoretical Perspective

Add code
Jan 31, 2025
Viaarxiv icon

Batch Normalization Decomposed

Add code
Dec 03, 2024
Viaarxiv icon

Which Algorithms Have Tight Generalization Bounds?

Add code
Oct 02, 2024
Viaarxiv icon

Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants

Add code
Aug 07, 2024
Figure 1 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 2 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 3 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Figure 4 for Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants
Viaarxiv icon

Transformers on Markov Data: Constant Depth Suffices

Add code
Jul 25, 2024
Figure 1 for Transformers on Markov Data: Constant Depth Suffices
Figure 2 for Transformers on Markov Data: Constant Depth Suffices
Figure 3 for Transformers on Markov Data: Constant Depth Suffices
Figure 4 for Transformers on Markov Data: Constant Depth Suffices
Viaarxiv icon

Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models

Add code
Jul 22, 2024
Figure 1 for Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
Figure 2 for Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
Figure 3 for Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
Figure 4 for Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
Viaarxiv icon

Local to Global: Learning Dynamics and Effect of Initialization for Transformers

Add code
Jun 05, 2024
Figure 1 for Local to Global: Learning Dynamics and Effect of Initialization for Transformers
Figure 2 for Local to Global: Learning Dynamics and Effect of Initialization for Transformers
Figure 3 for Local to Global: Learning Dynamics and Effect of Initialization for Transformers
Figure 4 for Local to Global: Learning Dynamics and Effect of Initialization for Transformers
Viaarxiv icon

The Fundamental Limits of Least-Privilege Learning

Add code
Feb 19, 2024
Figure 1 for The Fundamental Limits of Least-Privilege Learning
Figure 2 for The Fundamental Limits of Least-Privilege Learning
Figure 3 for The Fundamental Limits of Least-Privilege Learning
Figure 4 for The Fundamental Limits of Least-Privilege Learning
Viaarxiv icon