Alert button
Picture for Lechao Xiao

Lechao Xiao

Alert button

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Bookmark button
Alert button
Dec 22, 2023
Avi Singh, John D. Co-Reyes, Rishabh Agarwal, Ankesh Anand, Piyush Patil, Xavier Garcia, Peter J. Liu, James Harrison, Jaehoon Lee, Kelvin Xu, Aaron Parisi, Abhishek Kumar, Alex Alemi, Alex Rizkowsky, Azade Nova, Ben Adlam, Bernd Bohnet, Gamaleldin Elsayed, Hanie Sedghi, Igor Mordatch, Isabelle Simpson, Izzeddin Gur, Jasper Snoek, Jeffrey Pennington, Jiri Hron, Kathleen Kenealy, Kevin Swersky, Kshiteej Mahajan, Laura Culp, Lechao Xiao, Maxwell L. Bileschi, Noah Constant, Roman Novak, Rosanne Liu, Tris Warkentin, Yundi Qian, Yamini Bansal, Ethan Dyer, Behnam Neyshabur, Jascha Sohl-Dickstein, Noah Fiedel

Figure 1 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 2 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 3 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 4 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Viaarxiv icon

Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Add code
Bookmark button
Alert button
Nov 15, 2023
C. Daniel Freeman, Laura Culp, Aaron Parisi, Maxwell L Bileschi, Gamaleldin F Elsayed, Alex Rizkowsky, Isabelle Simpson, Alex Alemi, Azade Nova, Ben Adlam, Bernd Bohnet, Gaurav Mishra, Hanie Sedghi, Igor Mordatch, Izzeddin Gur, Jaehoon Lee, JD Co-Reyes, Jeffrey Pennington, Kelvin Xu, Kevin Swersky, Kshiteej Mahajan, Lechao Xiao, Rosanne Liu, Simon Kornblith, Noah Constant, Peter J. Liu, Roman Novak, Yundi Qian, Noah Fiedel, Jascha Sohl-Dickstein

Viaarxiv icon

Small-scale proxies for large-scale Transformer training instabilities

Add code
Bookmark button
Alert button
Sep 25, 2023
Mitchell Wortsman, Peter J. Liu, Lechao Xiao, Katie Everett, Alex Alemi, Ben Adlam, John D. Co-Reyes, Izzeddin Gur, Abhishek Kumar, Roman Novak, Jeffrey Pennington, Jascha Sohl-dickstein, Kelvin Xu, Jaehoon Lee, Justin Gilmer, Simon Kornblith

Figure 1 for Small-scale proxies for large-scale Transformer training instabilities
Figure 2 for Small-scale proxies for large-scale Transformer training instabilities
Figure 3 for Small-scale proxies for large-scale Transformer training instabilities
Figure 4 for Small-scale proxies for large-scale Transformer training instabilities
Viaarxiv icon

Fast Neural Kernel Embeddings for General Activations

Add code
Bookmark button
Alert button
Sep 09, 2022
Insu Han, Amir Zandieh, Jaehoon Lee, Roman Novak, Lechao Xiao, Amin Karbasi

Figure 1 for Fast Neural Kernel Embeddings for General Activations
Figure 2 for Fast Neural Kernel Embeddings for General Activations
Figure 3 for Fast Neural Kernel Embeddings for General Activations
Figure 4 for Fast Neural Kernel Embeddings for General Activations
Viaarxiv icon

Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm

Add code
Bookmark button
Alert button
Jul 11, 2022
Lechao Xiao, Jeffrey Pennington

Figure 1 for Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm
Figure 2 for Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm
Figure 3 for Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm
Figure 4 for Synergy and Symmetry in Deep Learning: Interactions between the Data, Model, and Inference Algorithm
Viaarxiv icon

Precise Learning Curves and Higher-Order Scaling Limits for Dot Product Kernel Regression

Add code
Bookmark button
Alert button
May 30, 2022
Lechao Xiao, Jeffrey Pennington

Figure 1 for Precise Learning Curves and Higher-Order Scaling Limits for Dot Product Kernel Regression
Figure 2 for Precise Learning Curves and Higher-Order Scaling Limits for Dot Product Kernel Regression
Figure 3 for Precise Learning Curves and Higher-Order Scaling Limits for Dot Product Kernel Regression
Figure 4 for Precise Learning Curves and Higher-Order Scaling Limits for Dot Product Kernel Regression
Viaarxiv icon

Eigenspace Restructuring: a Principle of Space and Frequency in Neural Networks

Add code
Bookmark button
Alert button
Dec 10, 2021
Lechao Xiao

Figure 1 for Eigenspace Restructuring: a Principle of Space and Frequency in Neural Networks
Figure 2 for Eigenspace Restructuring: a Principle of Space and Frequency in Neural Networks
Figure 3 for Eigenspace Restructuring: a Principle of Space and Frequency in Neural Networks
Figure 4 for Eigenspace Restructuring: a Principle of Space and Frequency in Neural Networks
Viaarxiv icon

Dataset Distillation with Infinitely Wide Convolutional Networks

Add code
Bookmark button
Alert button
Jul 27, 2021
Timothy Nguyen, Roman Novak, Lechao Xiao, Jaehoon Lee

Figure 1 for Dataset Distillation with Infinitely Wide Convolutional Networks
Figure 2 for Dataset Distillation with Infinitely Wide Convolutional Networks
Figure 3 for Dataset Distillation with Infinitely Wide Convolutional Networks
Figure 4 for Dataset Distillation with Infinitely Wide Convolutional Networks
Viaarxiv icon

Exploring the Uncertainty Properties of Neural Networks' Implicit Priors in the Infinite-Width Limit

Add code
Bookmark button
Alert button
Oct 14, 2020
Ben Adlam, Jaehoon Lee, Lechao Xiao, Jeffrey Pennington, Jasper Snoek

Figure 1 for Exploring the Uncertainty Properties of Neural Networks' Implicit Priors in the Infinite-Width Limit
Figure 2 for Exploring the Uncertainty Properties of Neural Networks' Implicit Priors in the Infinite-Width Limit
Figure 3 for Exploring the Uncertainty Properties of Neural Networks' Implicit Priors in the Infinite-Width Limit
Figure 4 for Exploring the Uncertainty Properties of Neural Networks' Implicit Priors in the Infinite-Width Limit
Viaarxiv icon

Finite Versus Infinite Neural Networks: an Empirical Study

Add code
Bookmark button
Alert button
Sep 08, 2020
Jaehoon Lee, Samuel S. Schoenholz, Jeffrey Pennington, Ben Adlam, Lechao Xiao, Roman Novak, Jascha Sohl-Dickstein

Figure 1 for Finite Versus Infinite Neural Networks: an Empirical Study
Figure 2 for Finite Versus Infinite Neural Networks: an Empirical Study
Figure 3 for Finite Versus Infinite Neural Networks: an Empirical Study
Figure 4 for Finite Versus Infinite Neural Networks: an Empirical Study
Viaarxiv icon