Alert button
Picture for Ben Adlam

Ben Adlam

Alert button

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Bookmark button
Alert button
Dec 22, 2023
Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel

Figure 1 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 2 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 3 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 4 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Viaarxiv icon

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Add code
Bookmark button
Alert button
Nov 15, 2023
C. Daniel Freeman, Laura Culp, Aaron Parisi, Maxwell L Bileschi, Gamaleldin F Elsayed, Alex Rizkowsky, Isabelle Simpson, Alex Alemi, Azade Nova, Ben Adlam, Bernd Bohnet, Gaurav Mishra, Hanie Sedghi, Igor Mordatch, Izzeddin Gur, Jaehoon Lee, JD Co-Reyes, Jeffrey Pennington, Kelvin Xu, Kevin Swersky, Kshiteej Mahajan, Lechao Xiao, Rosanne Liu, Simon Kornblith, Noah Constant, Peter J. Liu, Roman Novak, Yundi Qian, Noah Fiedel, Jascha Sohl-Dickstein

Viaarxiv icon

Small-scale proxies for large-scale Transformer training instabilities

Add code
Bookmark button
Alert button
Sep 25, 2023
Mitchell Wortsman, Peter J. Liu, Lechao Xiao, Katie Everett, Alex Alemi, Ben Adlam, John D. Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith

Figure 1 for Small-scale proxies for large-scale Transformer training instabilities
Figure 2 for Small-scale proxies for large-scale Transformer training instabilities
Figure 3 for Small-scale proxies for large-scale Transformer training instabilities
Figure 4 for Small-scale proxies for large-scale Transformer training instabilities
Viaarxiv icon

Kernel Regression with Infinite-Width Neural Networks on Millions of Examples

Add code
Bookmark button
Alert button
Mar 09, 2023
Ben Adlam, Jaehoon Lee, Shreyas Padhy, Zachary Nado, Jasper Snoek

Figure 1 for Kernel Regression with Infinite-Width Neural Networks on Millions of Examples
Figure 2 for Kernel Regression with Infinite-Width Neural Networks on Millions of Examples
Figure 3 for Kernel Regression with Infinite-Width Neural Networks on Millions of Examples
Figure 4 for Kernel Regression with Infinite-Width Neural Networks on Millions of Examples
Viaarxiv icon

Ensembling over Classifiers: a Bias-Variance Perspective

Add code
Bookmark button
Alert button
Jun 21, 2022
Neha Gupta, Jamie Smith, Ben Adlam, Zelda Mariet

Figure 1 for Ensembling over Classifiers: a Bias-Variance Perspective
Figure 2 for Ensembling over Classifiers: a Bias-Variance Perspective
Figure 3 for Ensembling over Classifiers: a Bias-Variance Perspective
Figure 4 for Ensembling over Classifiers: a Bias-Variance Perspective
Viaarxiv icon

Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions

Add code
Bookmark button
Alert button
Jun 15, 2022
Courtney Paquette, Elliot Paquette, Ben Adlam, Jeffrey Pennington

Figure 1 for Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions
Figure 2 for Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions
Figure 3 for Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions
Figure 4 for Implicit Regularization or Implicit Conditioning? Exact Risk Trajectories of SGD in High Dimensions
Viaarxiv icon

Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties

Add code
Bookmark button
Alert button
May 14, 2022
Courtney Paquette, Elliot Paquette, Ben Adlam, Jeffrey Pennington

Figure 1 for Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties
Figure 2 for Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties
Figure 3 for Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties
Figure 4 for Homogenization of SGD in high-dimensions: Exact dynamics and generalization properties
Viaarxiv icon

Understanding the bias-variance tradeoff of Bregman divergences

Add code
Bookmark button
Alert button
Feb 10, 2022
Ben Adlam, Neha Gupta, Zelda Mariet, Jamie Smith

Figure 1 for Understanding the bias-variance tradeoff of Bregman divergences
Figure 2 for Understanding the bias-variance tradeoff of Bregman divergences
Viaarxiv icon

Covariate Shift in High-Dimensional Random Feature Regression

Add code
Bookmark button
Alert button
Nov 16, 2021
Nilesh Tripuraneni, Ben Adlam, Jeffrey Pennington

Figure 1 for Covariate Shift in High-Dimensional Random Feature Regression
Figure 2 for Covariate Shift in High-Dimensional Random Feature Regression
Figure 3 for Covariate Shift in High-Dimensional Random Feature Regression
Figure 4 for Covariate Shift in High-Dimensional Random Feature Regression
Viaarxiv icon

Underspecification Presents Challenges for Credibility in Modern Machine Learning

Add code
Bookmark button
Alert button
Nov 06, 2020
Alexander D'Amour, Katherine Heller, Dan Moldovan, Ben Adlam, Babak Alipanahi, Alex Beutel, Christina Chen, Jonathan Deaton, Jacob Eisenstein, Matthew D. Hoffman, Farhad Hormozdiari, Neil Houlsby, Shaobo Hou, Ghassen Jerfel, Alan Karthikesalingam, Mario Lucic, Yian Ma, Cory McLean, Diana Mincu, Akinori Mitani, Andrea Montanari, Zachary Nado, Vivek Natarajan, Christopher Nielson, Thomas F. Osborne, Rajiv Raman, Kim Ramasamy, Rory Sayres, Jessica Schrouff, Martin Seneviratne, Shannon Sequeira, Harini Suresh, Victor Veitch, Max Vladymyrov, Xuezhi Wang, Kellie Webster, Steve Yadlowsky, Taedong Yun, Xiaohua Zhai, D. Sculley

Figure 1 for Underspecification Presents Challenges for Credibility in Modern Machine Learning
Figure 2 for Underspecification Presents Challenges for Credibility in Modern Machine Learning
Figure 3 for Underspecification Presents Challenges for Credibility in Modern Machine Learning
Figure 4 for Underspecification Presents Challenges for Credibility in Modern Machine Learning
Viaarxiv icon