Picture for Greg Yang

Greg Yang

A Spectral Condition for Feature Learning

Add code
Oct 26, 2023
Figure 1 for A Spectral Condition for Feature Learning
Figure 2 for A Spectral Condition for Feature Learning
Figure 3 for A Spectral Condition for Feature Learning
Figure 4 for A Spectral Condition for Feature Learning
Viaarxiv icon

Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks

Add code
Oct 12, 2023
Figure 1 for Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks
Figure 2 for Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks
Figure 3 for Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks
Figure 4 for Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks
Viaarxiv icon

Tensor Programs IVb: Adaptive Optimization in the Infinite-Width Limit

Add code
Aug 07, 2023
Figure 1 for Tensor Programs IVb: Adaptive Optimization in the Infinite-Width Limit
Figure 2 for Tensor Programs IVb: Adaptive Optimization in the Infinite-Width Limit
Figure 3 for Tensor Programs IVb: Adaptive Optimization in the Infinite-Width Limit
Figure 4 for Tensor Programs IVb: Adaptive Optimization in the Infinite-Width Limit
Viaarxiv icon

Width and Depth Limits Commute in Residual Networks

Add code
Feb 01, 2023
Figure 1 for Width and Depth Limits Commute in Residual Networks
Figure 2 for Width and Depth Limits Commute in Residual Networks
Figure 3 for Width and Depth Limits Commute in Residual Networks
Figure 4 for Width and Depth Limits Commute in Residual Networks
Viaarxiv icon

High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation

Add code
May 03, 2022
Figure 1 for High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation
Figure 2 for High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation
Figure 3 for High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation
Figure 4 for High-dimensional Asymptotics of Feature Learning: How One Gradient Step Improves the Representation
Viaarxiv icon

Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Add code
Mar 28, 2022
Figure 1 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 2 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 3 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Figure 4 for Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer
Viaarxiv icon

CLUES: Few-Shot Learning Evaluation in Natural Language Understanding

Add code
Nov 04, 2021
Figure 1 for CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Figure 2 for CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Figure 3 for CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Figure 4 for CLUES: Few-Shot Learning Evaluation in Natural Language Understanding
Viaarxiv icon

Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks

Add code
Jul 02, 2021
Figure 1 for Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks
Figure 2 for Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks
Figure 3 for Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks
Figure 4 for Implicit Acceleration and Feature Learning in Infinitely Wide Neural Networks with Bottlenecks
Viaarxiv icon

3DB: A Framework for Debugging Computer Vision Models

Add code
Jun 07, 2021
Figure 1 for 3DB: A Framework for Debugging Computer Vision Models
Figure 2 for 3DB: A Framework for Debugging Computer Vision Models
Figure 3 for 3DB: A Framework for Debugging Computer Vision Models
Figure 4 for 3DB: A Framework for Debugging Computer Vision Models
Viaarxiv icon

Tensor Programs IIb: Architectural Universality of Neural Tangent Kernel Training Dynamics

Add code
May 08, 2021
Figure 1 for Tensor Programs IIb: Architectural Universality of Neural Tangent Kernel Training Dynamics
Figure 2 for Tensor Programs IIb: Architectural Universality of Neural Tangent Kernel Training Dynamics
Figure 3 for Tensor Programs IIb: Architectural Universality of Neural Tangent Kernel Training Dynamics
Figure 4 for Tensor Programs IIb: Architectural Universality of Neural Tangent Kernel Training Dynamics
Viaarxiv icon