Alert button
Picture for Juhan Bae

Juhan Bae

Alert button

Can We Remove the Square-Root in Adaptive Gradient Methods? A Second-Order Perspective

Add code
Bookmark button
Alert button
Feb 13, 2024
Wu Lin, Felix Dangel, Runa Eschenhagen, Juhan Bae, Richard E. Turner, Alireza Makhzani

Viaarxiv icon

Using Large Language Models for Hyperparameter Optimization

Add code
Bookmark button
Alert button
Dec 07, 2023
Michael R. Zhang, Nishkrit Desai, Juhan Bae, Jonathan Lorraine, Jimmy Ba

Viaarxiv icon

Studying Large Language Model Generalization with Influence Functions

Add code
Bookmark button
Alert button
Aug 07, 2023
Roger Grosse, Juhan Bae, Cem Anil, Nelson Elhage, Alex Tamkin, Amirhossein Tajdini, Benoit Steiner, Dustin Li, Esin Durmus, Ethan Perez, Evan Hubinger, Kamilė Lukošiūtė, Karina Nguyen, Nicholas Joseph, Sam McCandlish, Jared Kaplan, Samuel R. Bowman

Figure 1 for Studying Large Language Model Generalization with Influence Functions
Figure 2 for Studying Large Language Model Generalization with Influence Functions
Figure 3 for Studying Large Language Model Generalization with Influence Functions
Figure 4 for Studying Large Language Model Generalization with Influence Functions
Viaarxiv icon

Benchmarking Neural Network Training Algorithms

Add code
Bookmark button
Alert button
Jun 12, 2023
George E. Dahl, Frank Schneider, Zachary Nado, Naman Agarwal, Chandramouli Shama Sastry, Philipp Hennig, Sourabh Medapati, Runa Eschenhagen, Priya Kasimbeg, Daniel Suo, Juhan Bae, Justin Gilmer, Abel L. Peirson, Bilal Khan, Rohan Anil, Mike Rabbat, Shankar Krishnan, Daniel Snider, Ehsan Amid, Kongtao Chen, Chris J. Maddison, Rakshith Vasudev, Michal Badura, Ankush Garg, Peter Mattson

Figure 1 for Benchmarking Neural Network Training Algorithms
Figure 2 for Benchmarking Neural Network Training Algorithms
Figure 3 for Benchmarking Neural Network Training Algorithms
Figure 4 for Benchmarking Neural Network Training Algorithms
Viaarxiv icon

Efficient Parametric Approximations of Neural Network Function Space Distance

Add code
Bookmark button
Alert button
Feb 07, 2023
Nikita Dhawan, Sicong Huang, Juhan Bae, Roger Grosse

Figure 1 for Efficient Parametric Approximations of Neural Network Function Space Distance
Figure 2 for Efficient Parametric Approximations of Neural Network Function Space Distance
Figure 3 for Efficient Parametric Approximations of Neural Network Function Space Distance
Figure 4 for Efficient Parametric Approximations of Neural Network Function Space Distance
Viaarxiv icon

Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve

Add code
Bookmark button
Alert button
Dec 07, 2022
Juhan Bae, Michael R. Zhang, Michael Ruan, Eric Wang, So Hasegawa, Jimmy Ba, Roger Grosse

Figure 1 for Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve
Figure 2 for Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve
Figure 3 for Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve
Figure 4 for Multi-Rate VAE: Train Once, Get the Full Rate-Distortion Curve
Viaarxiv icon

If Influence Functions are the Answer, Then What is the Question?

Add code
Bookmark button
Alert button
Sep 12, 2022
Juhan Bae, Nathan Ng, Alston Lo, Marzyeh Ghassemi, Roger Grosse

Figure 1 for If Influence Functions are the Answer, Then What is the Question?
Figure 2 for If Influence Functions are the Answer, Then What is the Question?
Figure 3 for If Influence Functions are the Answer, Then What is the Question?
Figure 4 for If Influence Functions are the Answer, Then What is the Question?
Viaarxiv icon

Amortized Proximal Optimization

Add code
Bookmark button
Alert button
Feb 28, 2022
Juhan Bae, Paul Vicol, Jeff Z. HaoChen, Roger Grosse

Figure 1 for Amortized Proximal Optimization
Figure 2 for Amortized Proximal Optimization
Figure 3 for Amortized Proximal Optimization
Figure 4 for Amortized Proximal Optimization
Viaarxiv icon

Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes

Add code
Bookmark button
Alert button
Apr 23, 2021
James Lucas, Juhan Bae, Michael R. Zhang, Stanislav Fort, Richard Zemel, Roger Grosse

Figure 1 for Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
Figure 2 for Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
Figure 3 for Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
Figure 4 for Analyzing Monotonic Linear Interpolation in Neural Network Loss Landscapes
Viaarxiv icon