Alert button
Picture for Sanjeev Arora

Sanjeev Arora

Alert button

Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates

Add code
Bookmark button
Alert button
Feb 28, 2024
Kaifeng Lyu, Haoyu Zhao, Xinran Gu, Dingli Yu, Anirudh Goyal, Sanjeev Arora

Viaarxiv icon

LESS: Selecting Influential Data for Targeted Instruction Tuning

Add code
Bookmark button
Alert button
Feb 20, 2024
Mengzhou Xia, Sadhika Malladi, Suchin Gururangan, Sanjeev Arora, Danqi Chen

Viaarxiv icon

Language Models as Science Tutors

Add code
Bookmark button
Alert button
Feb 16, 2024
Alexis Chevalier, Jiayi Geng, Alexander Wettig, Howard Chen, Sebastian Mizera, Toni Annala, Max Jameson Aragon, Arturo Rodríguez Fanlo, Simon Frieder, Simon Machado, Akshara Prabhakar, Ellie Thieu, Jiachen T. Wang, Zirui Wang, Xindi Wu, Mengzhou Xia, Wenhan Jia, Jiatong Yu, Jun-Jie Zhu, Zhiyong Jason Ren, Sanjeev Arora, Danqi Chen

Viaarxiv icon

Unlearning via Sparse Representations

Add code
Bookmark button
Alert button
Nov 26, 2023
Vedant Shah, Frederik Träuble, Ashish Malik, Hugo Larochelle, Michael Mozer, Sanjeev Arora, Yoshua Bengio, Anirudh Goyal

Viaarxiv icon

Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models

Add code
Bookmark button
Alert button
Oct 26, 2023
Dingli Yu, Simran Kaur, Arushi Gupta, Jonah Brown-Cohen, Anirudh Goyal, Sanjeev Arora

Figure 1 for Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models
Figure 2 for Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models
Figure 3 for Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models
Figure 4 for Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models
Viaarxiv icon

A Quadratic Synchronization Rule for Distributed Deep Learning

Add code
Bookmark button
Alert button
Oct 22, 2023
Xinran Gu, Kaifeng Lyu, Sanjeev Arora, Jingzhao Zhang, Longbo Huang

Viaarxiv icon

A Theory for Emergence of Complex Skills in Language Models

Add code
Bookmark button
Alert button
Jul 29, 2023
Sanjeev Arora, Anirudh Goyal

Figure 1 for A Theory for Emergence of Complex Skills in Language Models
Figure 2 for A Theory for Emergence of Complex Skills in Language Models
Viaarxiv icon

Trainable Transformer in Transformer

Add code
Bookmark button
Alert button
Jul 03, 2023
Abhishek Panigrahi, Sadhika Malladi, Mengzhou Xia, Sanjeev Arora

Viaarxiv icon

Fine-Tuning Language Models with Just Forward Passes

Add code
Bookmark button
Alert button
May 27, 2023
Sadhika Malladi, Tianyu Gao, Eshaan Nichani, Alex Damian, Jason D. Lee, Danqi Chen, Sanjeev Arora

Figure 1 for Fine-Tuning Language Models with Just Forward Passes
Figure 2 for Fine-Tuning Language Models with Just Forward Passes
Figure 3 for Fine-Tuning Language Models with Just Forward Passes
Figure 4 for Fine-Tuning Language Models with Just Forward Passes
Viaarxiv icon