Picture for Boris Hanin

Boris Hanin

Don't be lazy: CompleteP enables compute-efficient deep transformers

Add code
May 02, 2025
Viaarxiv icon

Deep Nets as Hamiltonians

Add code
Mar 31, 2025
Figure 1 for Deep Nets as Hamiltonians
Figure 2 for Deep Nets as Hamiltonians
Figure 3 for Deep Nets as Hamiltonians
Figure 4 for Deep Nets as Hamiltonians
Viaarxiv icon

Optimizing Model Selection for Compound AI Systems

Add code
Feb 20, 2025
Viaarxiv icon

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

Add code
Oct 11, 2024
Viaarxiv icon

Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design

Add code
Jul 23, 2024
Figure 1 for Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Figure 2 for Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Figure 3 for Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Figure 4 for Networks of Networks: Complexity Class Principles Applied to Compound AI Systems Design
Viaarxiv icon

Bayesian Inference with Deep Weakly Nonlinear Networks

Add code
May 26, 2024
Figure 1 for Bayesian Inference with Deep Weakly Nonlinear Networks
Figure 2 for Bayesian Inference with Deep Weakly Nonlinear Networks
Figure 3 for Bayesian Inference with Deep Weakly Nonlinear Networks
Viaarxiv icon

Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

Add code
Mar 04, 2024
Figure 1 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 2 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 3 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 4 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Viaarxiv icon

Principled Architecture-aware Scaling of Hyperparameters

Add code
Feb 27, 2024
Figure 1 for Principled Architecture-aware Scaling of Hyperparameters
Figure 2 for Principled Architecture-aware Scaling of Hyperparameters
Figure 3 for Principled Architecture-aware Scaling of Hyperparameters
Figure 4 for Principled Architecture-aware Scaling of Hyperparameters
Viaarxiv icon

Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit

Add code
Sep 28, 2023
Figure 1 for Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit
Figure 2 for Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit
Figure 3 for Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit
Figure 4 for Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit
Viaarxiv icon

Les Houches Lectures on Deep Learning at Large & Infinite Width

Add code
Sep 08, 2023
Viaarxiv icon