Picture for Sham Kakade

Sham Kakade

Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs

Add code
Jun 25, 2025
Viaarxiv icon

Characterization and Mitigation of Training Instabilities in Microscaling Formats

Add code
Jun 25, 2025
Viaarxiv icon

A Simplified Analysis of SGD for Linear Regression with Weight Averaging

Add code
Jun 18, 2025
Viaarxiv icon

Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning

Add code
Jun 12, 2025
Viaarxiv icon

Interpreting the Linear Structure of Vision-language Model Embedding Spaces

Add code
Apr 16, 2025
Viaarxiv icon

Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining

Add code
Apr 10, 2025
Viaarxiv icon

Data-Efficient Multi-Agent Spatial Planning with LLMs

Add code
Feb 26, 2025
Viaarxiv icon

Distributional Scaling Laws for Emergent Capabilities

Add code
Feb 24, 2025
Viaarxiv icon

Train for the Worst, Plan for the Best: Understanding Token Ordering in Masked Diffusions

Add code
Feb 10, 2025
Viaarxiv icon

Connections between Schedule-Free Optimizers, AdEMAMix, and Accelerated SGD Variants

Add code
Feb 04, 2025
Viaarxiv icon