Picture for Hannaneh Hajishirzi

Hannaneh Hajishirzi

Shammie

Olmo Hybrid: From Theory to Practice and Back

Add code
Apr 07, 2026
Viaarxiv icon

TurnWise: The Gap between Single- and Multi-turn Language Model Capabilities

Add code
Mar 17, 2026
Viaarxiv icon

Meta-Reinforcement Learning with Self-Reflection for Agentic Search

Add code
Mar 11, 2026
Viaarxiv icon

Learning to Detect Language Model Training Data via Active Reconstruction

Add code
Feb 22, 2026
Viaarxiv icon

Small Reward Models via Backward Inference

Add code
Feb 14, 2026
Viaarxiv icon

Olmix: A Framework for Data Mixing Throughout LM Development

Add code
Feb 12, 2026
Viaarxiv icon

MentorCollab: Selective Large-to-Small Inference-Time Guidance for Efficient Reasoning

Add code
Feb 05, 2026
Viaarxiv icon

Olmo 3

Add code
Dec 15, 2025
Viaarxiv icon

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Add code
Nov 10, 2025
Figure 1 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 2 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 3 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 4 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Viaarxiv icon

Signal and Noise: A Framework for Reducing Uncertainty in Language Model Evaluation

Add code
Aug 18, 2025
Viaarxiv icon