Picture for Sham Kakade

Sham Kakade

The Potential of Second-Order Optimization for LLMs: A Study with Full Gauss-Newton

Add code
Oct 10, 2025
Viaarxiv icon

Fine-Tuning Masked Diffusion for Provable Self-Correction

Add code
Oct 01, 2025
Viaarxiv icon

Selective Underfitting in Diffusion Models

Add code
Oct 01, 2025
Viaarxiv icon

Characterization and Mitigation of Training Instabilities in Microscaling Formats

Add code
Jun 25, 2025
Viaarxiv icon

Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs

Add code
Jun 25, 2025
Viaarxiv icon

A Simplified Analysis of SGD for Linear Regression with Weight Averaging

Add code
Jun 18, 2025
Viaarxiv icon

Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning

Add code
Jun 12, 2025
Viaarxiv icon

Interpreting the Linear Structure of Vision-language Model Embedding Spaces

Add code
Apr 16, 2025
Figure 1 for Interpreting the Linear Structure of Vision-language Model Embedding Spaces
Figure 2 for Interpreting the Linear Structure of Vision-language Model Embedding Spaces
Figure 3 for Interpreting the Linear Structure of Vision-language Model Embedding Spaces
Figure 4 for Interpreting the Linear Structure of Vision-language Model Embedding Spaces
Viaarxiv icon

Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining

Add code
Apr 10, 2025
Figure 1 for Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining
Figure 2 for Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining
Figure 3 for Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining
Figure 4 for Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining
Viaarxiv icon

Data-Efficient Multi-Agent Spatial Planning with LLMs

Add code
Feb 26, 2025
Figure 1 for Data-Efficient Multi-Agent Spatial Planning with LLMs
Figure 2 for Data-Efficient Multi-Agent Spatial Planning with LLMs
Figure 3 for Data-Efficient Multi-Agent Spatial Planning with LLMs
Figure 4 for Data-Efficient Multi-Agent Spatial Planning with LLMs
Viaarxiv icon