Alert button
Picture for Surya Ganguli

Surya Ganguli

Alert button

Geometric Dynamics of Signal Propagation Predict Trainability of Transformers

Add code
Bookmark button
Alert button
Mar 05, 2024
Aditya Cowsik, Tamra Nebabu, Xiao-Liang Qi, Surya Ganguli

Figure 1 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 2 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 3 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Figure 4 for Geometric Dynamics of Signal Propagation Predict Trainability of Transformers
Viaarxiv icon

Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression

Add code
Bookmark button
Alert button
Jun 26, 2023
Allan Raventós, Mansheej Paul, Feng Chen, Surya Ganguli

Figure 1 for Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression
Figure 2 for Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression
Figure 3 for Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression
Figure 4 for Pretraining task diversity and the emergence of non-Bayesian in-context learning for regression
Viaarxiv icon

Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks

Add code
Bookmark button
Alert button
Jun 07, 2023
Feng Chen, Daniel Kunin, Atsushi Yamamura, Surya Ganguli

Figure 1 for Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks
Figure 2 for Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks
Figure 3 for Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks
Figure 4 for Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks
Viaarxiv icon

SemDeDup: Data-efficient learning at web-scale through semantic deduplication

Add code
Bookmark button
Alert button
Mar 22, 2023
Amro Abbas, Kushal Tirumala, Dániel Simig, Surya Ganguli, Ari S. Morcos

Figure 1 for SemDeDup: Data-efficient learning at web-scale through semantic deduplication
Figure 2 for SemDeDup: Data-efficient learning at web-scale through semantic deduplication
Figure 3 for SemDeDup: Data-efficient learning at web-scale through semantic deduplication
Figure 4 for SemDeDup: Data-efficient learning at web-scale through semantic deduplication
Viaarxiv icon

Holistic Evaluation of Language Models

Add code
Bookmark button
Alert button
Nov 16, 2022
Percy Liang, Rishi Bommasani, Tony Lee, Dimitris Tsipras, Dilara Soylu, Michihiro Yasunaga, Yian Zhang, Deepak Narayanan, Yuhuai Wu, Ananya Kumar, Benjamin Newman, Binhang Yuan, Bobby Yan, Ce Zhang, Christian Cosgrove, Christopher D. Manning, Christopher Ré, Diana Acosta-Navas, Drew A. Hudson, Eric Zelikman, Esin Durmus, Faisal Ladhak, Frieda Rong, Hongyu Ren, Huaxiu Yao, Jue Wang, Keshav Santhanam, Laurel Orr, Lucia Zheng, Mert Yuksekgonul, Mirac Suzgun, Nathan Kim, Neel Guha, Niladri Chatterji, Omar Khattab, Peter Henderson, Qian Huang, Ryan Chi, Sang Michael Xie, Shibani Santurkar, Surya Ganguli, Tatsunori Hashimoto, Thomas Icard, Tianyi Zhang, Vishrav Chaudhary, William Wang, Xuechen Li, Yifan Mai, Yuhui Zhang, Yuta Koreeda

Figure 1 for Holistic Evaluation of Language Models
Figure 2 for Holistic Evaluation of Language Models
Figure 3 for Holistic Evaluation of Language Models
Figure 4 for Holistic Evaluation of Language Models
Viaarxiv icon

Toward Next-Generation Artificial Intelligence: Catalyzing the NeuroAI Revolution

Add code
Bookmark button
Alert button
Oct 15, 2022
Anthony Zador, Blake Richards, Bence Ölveczky, Sean Escola, Yoshua Bengio, Kwabena Boahen, Matthew Botvinick, Dmitri Chklovskii, Anne Churchland, Claudia Clopath, James DiCarlo, Surya Ganguli, Jeff Hawkins, Konrad Koerding, Alexei Koulakov, Yann LeCun, Timothy Lillicrap, Adam Marblestone, Bruno Olshausen, Alexandre Pouget, Cristina Savin, Terrence Sejnowski, Eero Simoncelli, Sara Solla, David Sussillo, Andreas S. Tolias, Doris Tsao

Viaarxiv icon

What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries

Add code
Bookmark button
Alert button
Oct 11, 2022
Stanislav Fort, Ekin Dogus Cubuk, Surya Ganguli, Samuel S. Schoenholz

Figure 1 for What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries
Figure 2 for What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries
Figure 3 for What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries
Figure 4 for What does a deep neural network confidently perceive? The effective dimension of high certainty class manifolds and their low confidence boundaries
Viaarxiv icon

The Asymmetric Maximum Margin Bias of Quasi-Homogeneous Neural Networks

Add code
Bookmark button
Alert button
Oct 07, 2022
Daniel Kunin, Atsushi Yamamura, Chao Ma, Surya Ganguli

Figure 1 for The Asymmetric Maximum Margin Bias of Quasi-Homogeneous Neural Networks
Figure 2 for The Asymmetric Maximum Margin Bias of Quasi-Homogeneous Neural Networks
Figure 3 for The Asymmetric Maximum Margin Bias of Quasi-Homogeneous Neural Networks
Viaarxiv icon

Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?

Add code
Bookmark button
Alert button
Oct 06, 2022
Mansheej Paul, Feng Chen, Brett W. Larsen, Jonathan Frankle, Surya Ganguli, Gintare Karolina Dziugaite

Figure 1 for Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?
Figure 2 for Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?
Figure 3 for Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?
Figure 4 for Unmasking the Lottery Ticket Hypothesis: What's Encoded in a Winning Ticket's Mask?
Viaarxiv icon