Picture for Hannaneh Hajishirzi

Hannaneh Hajishirzi

Shammie

FlexOlmo: Open Language Models for Flexible Data Use

Add code
Jul 09, 2025
Viaarxiv icon

Generalizing Verifiable Instruction Following

Add code
Jul 03, 2025
Viaarxiv icon

SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks

Add code
Jul 01, 2025
Viaarxiv icon

OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization

Add code
Jun 23, 2025
Viaarxiv icon

Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index

Add code
Jun 13, 2025
Viaarxiv icon

Spurious Rewards: Rethinking Training Signals in RLVR

Add code
Jun 12, 2025
Viaarxiv icon

ScienceMeter: Tracking Scientific Knowledge Updates in Language Models

Add code
May 30, 2025
Viaarxiv icon

Critical Batch Size Revisited: A Simple Empirical Approach to Large-Batch Language Model Training

Add code
May 29, 2025
Viaarxiv icon

ParaPO: Aligning Language Models to Reduce Verbatim Reproduction of Pre-training Data

Add code
Apr 20, 2025
Viaarxiv icon

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Add code
Apr 09, 2025
Viaarxiv icon