Alert button
Picture for Andrey Gromov

Andrey Gromov

Alert button

Is Model Collapse Inevitable? Breaking the Curse of Recursion by Accumulating Real and Synthetic Data

Add code
Bookmark button
Alert button
Apr 01, 2024
Matthias Gerstgrasser, Rylan Schaeffer, Apratim Dey, Rafael Rafailov, Henry Sleight, John Hughes, Tomasz Korbak, Rajashree Agrawal, Dhruv Pai, Andrey Gromov, Daniel A. Roberts, Diyi Yang, David L. Donoho, Sanmi Koyejo

Viaarxiv icon

The Unreasonable Ineffectiveness of the Deeper Layers

Add code
Bookmark button
Alert button
Mar 26, 2024
Andrey Gromov, Kushal Tirumala, Hassan Shapourian, Paolo Glorioso, Daniel A. Roberts

Viaarxiv icon

Bridging Associative Memory and Probabilistic Modeling

Add code
Bookmark button
Alert button
Feb 15, 2024
Rylan Schaeffer, Nika Zahedi, Mikail Khona, Dhruv Pai, Sang Truong, Yilun Du, Mitchell Ostrow, Sarthak Chandra, Andres Carranza, Ila Rani Fiete, Andrey Gromov, Sanmi Koyejo

Viaarxiv icon

To grok or not to grok: Disentangling generalization and memorization on corrupted algorithmic datasets

Add code
Bookmark button
Alert button
Oct 19, 2023
Darshil Doshi, Aritra Das, Tianyu He, Andrey Gromov

Viaarxiv icon

Grokking modular arithmetic

Add code
Bookmark button
Alert button
Jan 06, 2023
Andrey Gromov

Figure 1 for Grokking modular arithmetic
Figure 2 for Grokking modular arithmetic
Figure 3 for Grokking modular arithmetic
Figure 4 for Grokking modular arithmetic
Viaarxiv icon

AutoInit: Automatic Initialization via Jacobian Tuning

Add code
Bookmark button
Alert button
Jun 27, 2022
Tianyu He, Darshil Doshi, Andrey Gromov

Figure 1 for AutoInit: Automatic Initialization via Jacobian Tuning
Figure 2 for AutoInit: Automatic Initialization via Jacobian Tuning
Figure 3 for AutoInit: Automatic Initialization via Jacobian Tuning
Figure 4 for AutoInit: Automatic Initialization via Jacobian Tuning
Viaarxiv icon

Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm

Add code
Bookmark button
Alert button
Nov 30, 2021
Darshil Doshi, Tianyu He, Andrey Gromov

Figure 1 for Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm
Figure 2 for Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm
Figure 3 for Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm
Figure 4 for Critical initialization of wide and deep neural networks through partial Jacobians: general theory and applications to LayerNorm
Viaarxiv icon