Alert button
Picture for Suriya Gunasekar

Suriya Gunasekar

Alert button

KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval

Add code
Bookmark button
Alert button
Oct 24, 2023
Marah I Abdin, Suriya Gunasekar, Varun Chandrasekaran, Jerry Li, Mert Yuksekgonul, Rahee Ghosh Peshawaria, Ranjita Naik, Besmira Nushi

Viaarxiv icon

Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models

Add code
Bookmark button
Alert button
Sep 26, 2023
Mert Yuksekgonul, Varun Chandrasekaran, Erik Jones, Suriya Gunasekar, Ranjita Naik, Hamid Palangi, Ece Kamar, Besmira Nushi

Figure 1 for Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Figure 2 for Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Figure 3 for Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Figure 4 for Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
Viaarxiv icon

Textbooks Are All You Need II: phi-1.5 technical report

Add code
Bookmark button
Alert button
Sep 11, 2023
Yuanzhi Li, Sébastien Bubeck, Ronen Eldan, Allie Del Giorno, Suriya Gunasekar, Yin Tat Lee

Figure 1 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 2 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 3 for Textbooks Are All You Need II: phi-1.5 technical report
Figure 4 for Textbooks Are All You Need II: phi-1.5 technical report
Viaarxiv icon

Textbooks Are All You Need

Add code
Bookmark button
Alert button
Jun 20, 2023
Suriya Gunasekar, Yi Zhang, Jyoti Aneja, Caio César Teodoro Mendes, Allie Del Giorno, Sivakanth Gopi, Mojan Javaheripi, Piero Kauffmann, Gustavo de Rosa, Olli Saarikivi, Adil Salim, Shital Shah, Harkirat Singh Behl, Xin Wang, Sébastien Bubeck, Ronen Eldan, Adam Tauman Kalai, Yin Tat Lee, Yuanzhi Li

Figure 1 for Textbooks Are All You Need
Figure 2 for Textbooks Are All You Need
Figure 3 for Textbooks Are All You Need
Figure 4 for Textbooks Are All You Need
Viaarxiv icon

(S)GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability

Add code
Bookmark button
Alert button
Feb 17, 2023
Mathieu Even, Scott Pesme, Suriya Gunasekar, Nicolas Flammarion

Figure 1 for (S)GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability
Figure 2 for (S)GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability
Figure 3 for (S)GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability
Figure 4 for (S)GD over Diagonal Linear Networks: Implicit Regularisation, Large Stepsizes and Edge of Stability
Viaarxiv icon

How to Fine-Tune Vision Models with SGD

Add code
Bookmark button
Alert button
Nov 17, 2022
Ananya Kumar, Ruoqi Shen, Sébastien Bubeck, Suriya Gunasekar

Figure 1 for How to Fine-Tune Vision Models with SGD
Figure 2 for How to Fine-Tune Vision Models with SGD
Figure 3 for How to Fine-Tune Vision Models with SGD
Figure 4 for How to Fine-Tune Vision Models with SGD
Viaarxiv icon

Neural-Sim: Learning to Generate Training Data with NeRF

Add code
Bookmark button
Alert button
Jul 22, 2022
Yunhao Ge, Harkirat Behl, Jiashu Xu, Suriya Gunasekar, Neel Joshi, Yale Song, Xin Wang, Laurent Itti, Vibhav Vineet

Figure 1 for Neural-Sim: Learning to Generate Training Data with NeRF
Figure 2 for Neural-Sim: Learning to Generate Training Data with NeRF
Figure 3 for Neural-Sim: Learning to Generate Training Data with NeRF
Figure 4 for Neural-Sim: Learning to Generate Training Data with NeRF
Viaarxiv icon

Generalization to translation shifts: a study in architectures and augmentations

Add code
Bookmark button
Alert button
Jul 05, 2022
Suriya Gunasekar

Figure 1 for Generalization to translation shifts: a study in architectures and augmentations
Figure 2 for Generalization to translation shifts: a study in architectures and augmentations
Figure 3 for Generalization to translation shifts: a study in architectures and augmentations
Figure 4 for Generalization to translation shifts: a study in architectures and augmentations
Viaarxiv icon

Unveiling Transformers with LEGO: a synthetic reasoning task

Add code
Bookmark button
Alert button
Jun 09, 2022
Yi Zhang, Arturs Backurs, Sébastien Bubeck, Ronen Eldan, Suriya Gunasekar, Tal Wagner

Figure 1 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 2 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 3 for Unveiling Transformers with LEGO: a synthetic reasoning task
Figure 4 for Unveiling Transformers with LEGO: a synthetic reasoning task
Viaarxiv icon

Data Augmentation as Feature Manipulation: a story of desert cows and grass cows

Add code
Bookmark button
Alert button
Mar 03, 2022
Ruoqi Shen, Sébastien Bubeck, Suriya Gunasekar

Figure 1 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 2 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 3 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Figure 4 for Data Augmentation as Feature Manipulation: a story of desert cows and grass cows
Viaarxiv icon