Picture for Hannaneh Hajishirzi

Hannaneh Hajishirzi

Shammie

ScienceMeter: Tracking Scientific Knowledge Updates in Language Models

Add code
May 30, 2025
Viaarxiv icon

Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training

Add code
May 29, 2025
Viaarxiv icon

ParaPO: Aligning Language Models to Reduce Verbatim Reproduction of Pre-training Data

Add code
Apr 20, 2025
Viaarxiv icon

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Add code
Apr 09, 2025
Viaarxiv icon

Steering off Course: Reliability Challenges in Steering Language Models

Add code
Apr 06, 2025
Viaarxiv icon

EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees

Add code
Mar 11, 2025
Viaarxiv icon

s1: Simple test-time scaling

Add code
Jan 31, 2025
Figure 1 for s1: Simple test-time scaling
Figure 2 for s1: Simple test-time scaling
Figure 3 for s1: Simple test-time scaling
Figure 4 for s1: Simple test-time scaling
Viaarxiv icon

2 OLMo 2 Furious

Add code
Dec 31, 2024
Figure 1 for 2 OLMo 2 Furious
Figure 2 for 2 OLMo 2 Furious
Figure 3 for 2 OLMo 2 Furious
Figure 4 for 2 OLMo 2 Furious
Viaarxiv icon

HREF: Human Response-Guided Evaluation of Instruction Following in Language Models

Add code
Dec 20, 2024
Viaarxiv icon

A Systematic Examination of Preference Learning through the Lens of Instruction-Following

Add code
Dec 18, 2024
Viaarxiv icon