Picture for Evan Miller

Evan Miller

Adding Error Bars to Evals: A Statistical Approach to Language Model Evaluations

Add code
Nov 01, 2024
Viaarxiv icon

Learning Abduction under Partial Observability

Add code
Nov 25, 2017
Figure 1 for Learning Abduction under Partial Observability
Viaarxiv icon