Picture for Jonathan Berant

Jonathan Berant

Shammie

MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games

Add code
Feb 27, 2026
Viaarxiv icon

Comparing human and language models sentence processing difficulties on complex structures

Add code
Oct 08, 2025
Viaarxiv icon

Cost-Optimal Active AI Model Evaluation

Add code
Jun 09, 2025
Viaarxiv icon

Don't lie to your friends: Learning what you know from collaborative self-play

Add code
Mar 18, 2025
Viaarxiv icon

When the LM misunderstood the human chuckled: Analyzing garden path effects in humans and language models

Add code
Feb 13, 2025
Viaarxiv icon

InfAlign: Inference-aware language model alignment

Add code
Dec 27, 2024
Viaarxiv icon

ALTA: Compiler-Based Analysis of Transformers

Add code
Oct 23, 2024
Figure 1 for ALTA: Compiler-Based Analysis of Transformers
Figure 2 for ALTA: Compiler-Based Analysis of Transformers
Figure 3 for ALTA: Compiler-Based Analysis of Transformers
Figure 4 for ALTA: Compiler-Based Analysis of Transformers
Viaarxiv icon

Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning

Add code
Oct 10, 2024
Figure 1 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 2 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 3 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Figure 4 for Rewarding Progress: Scaling Automated Process Verifiers for LLM Reasoning
Viaarxiv icon

AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?

Add code
Jul 22, 2024
Viaarxiv icon

From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty

Add code
Jul 08, 2024
Figure 1 for From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Figure 2 for From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Figure 3 for From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Figure 4 for From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty
Viaarxiv icon