Picture for Surya Ganguli

Surya Ganguli

Deriving Neural Scaling Laws from the statistics of natural language

Add code
Feb 07, 2026
Viaarxiv icon

From Kepler to Newton: Inductive Biases Guide Learned World Models in Transformers

Add code
Feb 06, 2026
Viaarxiv icon

Contrastive Concept-Tree Search for LLM-Assisted Algorithm Discovery

Add code
Feb 03, 2026
Viaarxiv icon

Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks

Add code
Jun 06, 2025
Viaarxiv icon

Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning

Add code
Feb 11, 2025
Figure 1 for Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning
Figure 2 for Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning
Figure 3 for Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning
Figure 4 for Rethinking Fine-Tuning when Scaling Test-Time Compute: Limiting Confidence Improves Mathematical Reasoning
Viaarxiv icon

An analytic theory of creativity in convolutional diffusion models

Add code
Dec 28, 2024
Figure 1 for An analytic theory of creativity in convolutional diffusion models
Figure 2 for An analytic theory of creativity in convolutional diffusion models
Figure 3 for An analytic theory of creativity in convolutional diffusion models
Figure 4 for An analytic theory of creativity in convolutional diffusion models
Viaarxiv icon

Fooling LLM graders into giving better grades through neural activity guided adversarial prompting

Add code
Dec 17, 2024
Viaarxiv icon

Features are fate: a theory of transfer learning in high-dimensional regression

Add code
Oct 10, 2024
Figure 1 for Features are fate: a theory of transfer learning in high-dimensional regression
Figure 2 for Features are fate: a theory of transfer learning in high-dimensional regression
Figure 3 for Features are fate: a theory of transfer learning in high-dimensional regression
Figure 4 for Features are fate: a theory of transfer learning in high-dimensional regression
Viaarxiv icon

Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning

Add code
Jun 10, 2024
Figure 1 for Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning
Figure 2 for Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning
Figure 3 for Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning
Figure 4 for Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning
Viaarxiv icon

Geometric Dynamics of Signal Propagation Predict Trainability of Transformers

Add code
Mar 05, 2024
Viaarxiv icon