Picture for Sham Kakade

Sham Kakade

A New Perspective on Shampoo's Preconditioner

Add code
Jun 25, 2024
Viaarxiv icon

DataComp-LM: In search of the next generation of training sets for language models

Add code
Jun 18, 2024
Figure 1 for DataComp-LM: In search of the next generation of training sets for language models
Figure 2 for DataComp-LM: In search of the next generation of training sets for language models
Figure 3 for DataComp-LM: In search of the next generation of training sets for language models
Figure 4 for DataComp-LM: In search of the next generation of training sets for language models
Viaarxiv icon

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Add code
Jun 15, 2024
Viaarxiv icon

Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems

Add code
Feb 27, 2024
Viaarxiv icon

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Add code
Feb 22, 2024
Figure 1 for Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
Figure 2 for Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
Figure 3 for Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
Figure 4 for Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
Viaarxiv icon

A Study on the Calibration of In-context Learning

Add code
Dec 11, 2023
Viaarxiv icon

Feature emergence via margin maximization: case studies in algebraic tasks

Add code
Nov 13, 2023
Figure 1 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 2 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 3 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 4 for Feature emergence via margin maximization: case studies in algebraic tasks
Viaarxiv icon

Learning an Inventory Control Policy with General Inventory Arrival Dynamics

Add code
Oct 26, 2023
Figure 1 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 2 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 3 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 4 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Viaarxiv icon

MatFormer: Nested Transformer for Elastic Inference

Add code
Oct 11, 2023
Figure 1 for MatFormer: Nested Transformer for Elastic Inference
Figure 2 for MatFormer: Nested Transformer for Elastic Inference
Figure 3 for MatFormer: Nested Transformer for Elastic Inference
Figure 4 for MatFormer: Nested Transformer for Elastic Inference
Viaarxiv icon

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

Add code
Sep 07, 2023
Viaarxiv icon