Picture for Peter Bartlett

Peter Bartlett

Hard labels sampled from sparse targets mislead rotation invariant algorithms

Add code
Mar 21, 2026
Viaarxiv icon

Training Dynamics of Softmax Self-Attention: Fast Global Convergence via Preconditioning

Add code
Mar 02, 2026
Viaarxiv icon

Benefits of Early Stopping in Gradient Descent for Overparameterized Logistic Regression

Add code
Feb 18, 2025
Viaarxiv icon

Fast Best-of-N Decoding via Speculative Rejection

Add code
Oct 26, 2024
Figure 1 for Fast Best-of-N Decoding via Speculative Rejection
Figure 2 for Fast Best-of-N Decoding via Speculative Rejection
Figure 3 for Fast Best-of-N Decoding via Speculative Rejection
Figure 4 for Fast Best-of-N Decoding via Speculative Rejection
Viaarxiv icon

FutureFill: Fast Generation from Convolutional Sequence Models

Add code
Oct 02, 2024
Figure 1 for FutureFill: Fast Generation from Convolutional Sequence Models
Figure 2 for FutureFill: Fast Generation from Convolutional Sequence Models
Figure 3 for FutureFill: Fast Generation from Convolutional Sequence Models
Figure 4 for FutureFill: Fast Generation from Convolutional Sequence Models
Viaarxiv icon

Implicit Diffusion: Efficient Optimization through Stochastic Sampling

Add code
Feb 08, 2024
Figure 1 for Implicit Diffusion: Efficient Optimization through Stochastic Sampling
Figure 2 for Implicit Diffusion: Efficient Optimization through Stochastic Sampling
Figure 3 for Implicit Diffusion: Efficient Optimization through Stochastic Sampling
Figure 4 for Implicit Diffusion: Efficient Optimization through Stochastic Sampling
Viaarxiv icon

Contextual Bandits with Stage-wise Constraints

Add code
Jan 15, 2024
Viaarxiv icon

Can a Transformer Represent a Kalman Filter?

Add code
Dec 14, 2023
Viaarxiv icon

Joint Representation Training in Sequential Tasks with Shared Structure

Add code
Jun 24, 2022
Viaarxiv icon

Generalization Bounds for Data-Driven Numerical Linear Algebra

Add code
Jun 16, 2022
Viaarxiv icon