Picture for Jacob Steinhardt

Jacob Steinhardt

Establishing Best Practices for Building Rigorous Agentic Benchmarks

Add code
Jul 03, 2025
Viaarxiv icon

Understanding In-context Learning of Addition via Activation Subspaces

Add code
May 08, 2025
Viaarxiv icon

Uncovering Gaps in How Humans and LLMs Interpret Subjective Language

Add code
Mar 06, 2025
Figure 1 for Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
Figure 2 for Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
Figure 3 for Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
Figure 4 for Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
Viaarxiv icon

Which Attention Heads Matter for In-Context Learning?

Add code
Feb 19, 2025
Viaarxiv icon

Eliciting Language Model Behaviors with Investigator Agents

Add code
Feb 03, 2025
Viaarxiv icon

Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision

Add code
Jan 14, 2025
Viaarxiv icon

LatentQA: Teaching LLMs to Decode Activations Into Natural Language

Add code
Dec 11, 2024
Viaarxiv icon

Extractive Structures Learned in Pretraining Enable Generalization on Finetuned Facts

Add code
Dec 05, 2024
Viaarxiv icon

What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?

Add code
Nov 12, 2024
Figure 1 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 2 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 3 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Figure 4 for What Do Learning Dynamics Reveal About Generalization in LLM Reasoning?
Viaarxiv icon

VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models

Add code
Oct 10, 2024
Viaarxiv icon