Picture for Boris Hanin

Boris Hanin

Bayesian Inference with Deep Weakly Nonlinear Networks

May 26, 2024
Viaarxiv icon

Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

Mar 04, 2024
Figure 1 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 2 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 3 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 4 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Viaarxiv icon

Principled Architecture-aware Scaling of Hyperparameters

Add code
Feb 27, 2024
Viaarxiv icon

Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit

Add code
Sep 28, 2023
Viaarxiv icon

Les Houches Lectures on Deep Learning at Large & Infinite Width

Sep 08, 2023
Figure 1 for Les Houches Lectures on Deep Learning at Large & Infinite Width
Figure 2 for Les Houches Lectures on Deep Learning at Large & Infinite Width
Viaarxiv icon

Quantitative CLTs in Deep Neural Networks

Jul 21, 2023
Viaarxiv icon

Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations

Jun 20, 2023
Figure 1 for Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations
Figure 2 for Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations
Figure 3 for Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations
Figure 4 for Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations
Viaarxiv icon

Depth Dependence of $μ$P Learning Rates in ReLU MLPs

May 13, 2023
Viaarxiv icon

Bayesian Interpolation with Deep Linear Networks

Jan 02, 2023
Figure 1 for Bayesian Interpolation with Deep Linear Networks
Figure 2 for Bayesian Interpolation with Deep Linear Networks
Figure 3 for Bayesian Interpolation with Deep Linear Networks
Viaarxiv icon

Maximal Initial Learning Rates in Deep ReLU Networks

Dec 14, 2022
Figure 1 for Maximal Initial Learning Rates in Deep ReLU Networks
Figure 2 for Maximal Initial Learning Rates in Deep ReLU Networks
Figure 3 for Maximal Initial Learning Rates in Deep ReLU Networks
Figure 4 for Maximal Initial Learning Rates in Deep ReLU Networks
Viaarxiv icon