Alert button
Picture for Boris Hanin

Boris Hanin

Alert button

Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems

Add code
Bookmark button
Alert button
Mar 04, 2024
Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou

Figure 1 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 2 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 3 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Figure 4 for Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems
Viaarxiv icon

Principled Architecture-aware Scaling of Hyperparameters

Add code
Bookmark button
Alert button
Feb 27, 2024
Wuyang Chen, Junru Wu, Zhangyang Wang, Boris Hanin

Viaarxiv icon

Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit

Add code
Bookmark button
Alert button
Sep 28, 2023
Blake Bordelon, Lorenzo Noci, Mufan Bill Li, Boris Hanin, Cengiz Pehlevan

Viaarxiv icon

Les Houches Lectures on Deep Learning at Large & Infinite Width

Add code
Bookmark button
Alert button
Sep 08, 2023
Yasaman Bahri, Boris Hanin, Antonin Brossollet, Vittorio Erba, Christian Keup, Rosalba Pacelli, James B. Simon

Figure 1 for Les Houches Lectures on Deep Learning at Large & Infinite Width
Figure 2 for Les Houches Lectures on Deep Learning at Large & Infinite Width
Viaarxiv icon

Quantitative CLTs in Deep Neural Networks

Add code
Bookmark button
Alert button
Jul 21, 2023
Stefano Favaro, Boris Hanin, Domenico Marinucci, Ivan Nourdin, Giovanni Peccati

Viaarxiv icon

Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations

Add code
Bookmark button
Alert button
Jun 20, 2023
Gage DeZoort, Boris Hanin

Figure 1 for Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations
Figure 2 for Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations
Figure 3 for Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations
Figure 4 for Principles for Initialization and Architecture Selection in Graph Neural Networks with ReLU Activations
Viaarxiv icon

Depth Dependence of $μ$P Learning Rates in ReLU MLPs

Add code
Bookmark button
Alert button
May 13, 2023
Samy Jelassi, Boris Hanin, Ziwei Ji, Sashank J. Reddi, Srinadh Bhojanapalli, Sanjiv Kumar

Viaarxiv icon

Bayesian Interpolation with Deep Linear Networks

Add code
Bookmark button
Alert button
Jan 02, 2023
Boris Hanin, Alexander Zlokapa

Figure 1 for Bayesian Interpolation with Deep Linear Networks
Figure 2 for Bayesian Interpolation with Deep Linear Networks
Figure 3 for Bayesian Interpolation with Deep Linear Networks
Viaarxiv icon