Picture for Marc Finzi

Marc Finzi

From Entropy to Epiplexity: Rethinking Information for Computationally Bounded Intelligence

Add code
Jan 06, 2026
Viaarxiv icon

Compute-Optimal LLMs Provably Generalize Better With Scale

Add code
Apr 21, 2025
Figure 1 for Compute-Optimal LLMs Provably Generalize Better With Scale
Figure 2 for Compute-Optimal LLMs Provably Generalize Better With Scale
Figure 3 for Compute-Optimal LLMs Provably Generalize Better With Scale
Figure 4 for Compute-Optimal LLMs Provably Generalize Better With Scale
Viaarxiv icon

Antidistillation Sampling

Add code
Apr 17, 2025
Viaarxiv icon

Predicting the Performance of Black-box LLMs through Self-Queries

Add code
Jan 02, 2025
Figure 1 for Predicting the Performance of Black-box LLMs through Self-Queries
Figure 2 for Predicting the Performance of Black-box LLMs through Self-Queries
Figure 3 for Predicting the Performance of Black-box LLMs through Self-Queries
Figure 4 for Predicting the Performance of Black-box LLMs through Self-Queries
Viaarxiv icon

Diffusing Differentiable Representations

Add code
Dec 09, 2024
Viaarxiv icon

Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices

Add code
Oct 03, 2024
Figure 1 for Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices
Figure 2 for Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices
Figure 3 for Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices
Figure 4 for Searching for Efficient Linear Layers over a Continuous Space of Structured Matrices
Viaarxiv icon

Unlocking Tokens as Data Points for Generalization Bounds on Larger Language Models

Add code
Jul 25, 2024
Viaarxiv icon

Compute Better Spent: Replacing Dense Layers with Structured Matrices

Add code
Jun 10, 2024
Figure 1 for Compute Better Spent: Replacing Dense Layers with Structured Matrices
Figure 2 for Compute Better Spent: Replacing Dense Layers with Structured Matrices
Figure 3 for Compute Better Spent: Replacing Dense Layers with Structured Matrices
Figure 4 for Compute Better Spent: Replacing Dense Layers with Structured Matrices
Viaarxiv icon

Non-Vacuous Generalization Bounds for Large Language Models

Add code
Dec 28, 2023
Viaarxiv icon

Large Language Models Are Zero-Shot Time Series Forecasters

Add code
Oct 11, 2023
Viaarxiv icon