Alert button
Picture for Cyril Zhang

Cyril Zhang

Alert button

Can large language models explore in-context?

Add code
Bookmark button
Alert button
Mar 22, 2024
Akshay Krishnamurthy, Keegan Harris, Dylan J. Foster, Cyril Zhang, Aleksandrs Slivkins

Viaarxiv icon

Butterfly Effects of SGD Noise: Error Amplification in Behavior Cloning and Autoregression

Add code
Bookmark button
Alert button
Oct 17, 2023
Adam Block, Dylan J. Foster, Akshay Krishnamurthy, Max Simchowitz, Cyril Zhang

Viaarxiv icon

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

Add code
Bookmark button
Alert button
Sep 07, 2023
Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

Figure 1 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 2 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 3 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Figure 4 for Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck
Viaarxiv icon

Exposing Attention Glitches with Flip-Flop Language Modeling

Add code
Bookmark button
Alert button
Jun 01, 2023
Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang

Figure 1 for Exposing Attention Glitches with Flip-Flop Language Modeling
Figure 2 for Exposing Attention Glitches with Flip-Flop Language Modeling
Figure 3 for Exposing Attention Glitches with Flip-Flop Language Modeling
Figure 4 for Exposing Attention Glitches with Flip-Flop Language Modeling
Viaarxiv icon

Learning Hidden Markov Models Using Conditional Samples

Add code
Bookmark button
Alert button
Feb 28, 2023
Sham M. Kakade, Akshay Krishnamurthy, Gaurav Mahajan, Cyril Zhang

Figure 1 for Learning Hidden Markov Models Using Conditional Samples
Figure 2 for Learning Hidden Markov Models Using Conditional Samples
Viaarxiv icon

Neural Active Learning on Heteroskedastic Distributions

Add code
Bookmark button
Alert button
Nov 02, 2022
Savya Khosla, Chew Kin Whye, Jordan T. Ash, Cyril Zhang, Kenji Kawaguchi, Alex Lamb

Figure 1 for Neural Active Learning on Heteroskedastic Distributions
Figure 2 for Neural Active Learning on Heteroskedastic Distributions
Figure 3 for Neural Active Learning on Heteroskedastic Distributions
Figure 4 for Neural Active Learning on Heteroskedastic Distributions
Viaarxiv icon

Transformers Learn Shortcuts to Automata

Add code
Bookmark button
Alert button
Oct 19, 2022
Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang

Viaarxiv icon

Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms

Add code
Bookmark button
Alert button
Sep 01, 2022
Surbhi Goel, Sham Kakade, Adam Tauman Kalai, Cyril Zhang

Figure 1 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Figure 2 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Figure 3 for Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms
Viaarxiv icon

Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit

Add code
Bookmark button
Alert button
Jul 18, 2022
Boaz Barak, Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

Figure 1 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 2 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 3 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Figure 4 for Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit
Viaarxiv icon

Understanding Contrastive Learning Requires Incorporating Inductive Biases

Add code
Bookmark button
Alert button
Feb 28, 2022
Nikunj Saunshi, Jordan Ash, Surbhi Goel, Dipendra Misra, Cyril Zhang, Sanjeev Arora, Sham Kakade, Akshay Krishnamurthy

Figure 1 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 2 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 3 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Figure 4 for Understanding Contrastive Learning Requires Incorporating Inductive Biases
Viaarxiv icon