Picture for Sham Kakade

Sham Kakade

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Add code
Feb 22, 2024
Figure 1 for Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
Figure 2 for Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
Figure 3 for Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
Figure 4 for Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
Viaarxiv icon

A Study on the Calibration of In-context Learning

Add code
Dec 11, 2023
Viaarxiv icon

Feature emergence via margin maximization: case studies in algebraic tasks

Add code
Nov 13, 2023
Figure 1 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 2 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 3 for Feature emergence via margin maximization: case studies in algebraic tasks
Figure 4 for Feature emergence via margin maximization: case studies in algebraic tasks
Viaarxiv icon

Learning an Inventory Control Policy with General Inventory Arrival Dynamics

Add code
Oct 26, 2023
Figure 1 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 2 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 3 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 4 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Viaarxiv icon

MatFormer: Nested Transformer for Elastic Inference

Add code
Oct 11, 2023
Figure 1 for MatFormer: Nested Transformer for Elastic Inference
Figure 2 for MatFormer: Nested Transformer for Elastic Inference
Figure 3 for MatFormer: Nested Transformer for Elastic Inference
Figure 4 for MatFormer: Nested Transformer for Elastic Inference
Viaarxiv icon

Pareto Frontiers in Neural Feature Learning: Data, Compute, Width, and Luck

Add code
Sep 07, 2023
Viaarxiv icon

Scaling Laws for Imitation Learning in NetHack

Add code
Jul 18, 2023
Viaarxiv icon

Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning

Add code
Jun 14, 2023
Viaarxiv icon

AdANNS: A Framework for Adaptive Semantic Search

Add code
May 30, 2023
Figure 1 for AdANNS: A Framework for Adaptive Semantic Search
Figure 2 for AdANNS: A Framework for Adaptive Semantic Search
Figure 3 for AdANNS: A Framework for Adaptive Semantic Search
Figure 4 for AdANNS: A Framework for Adaptive Semantic Search
Viaarxiv icon

Modified Gauss-Newton Algorithms under Noise

Add code
May 18, 2023
Figure 1 for Modified Gauss-Newton Algorithms under Noise
Figure 2 for Modified Gauss-Newton Algorithms under Noise
Figure 3 for Modified Gauss-Newton Algorithms under Noise
Figure 4 for Modified Gauss-Newton Algorithms under Noise
Viaarxiv icon