Picture for Sam Bowyer

Sam Bowyer

Efficient Benchmarking Is Just Feature Selection and Multiple Regression

Add code
May 25, 2026
Viaarxiv icon

Learning Generation Orders for Masked Discrete Diffusion Models via Variational Inference

Add code
Feb 27, 2026
Viaarxiv icon

Massively Parallel Expectation Maximization For Approximate Posteriors

Add code
Mar 11, 2025
Viaarxiv icon

Position: Don't use the CLT in LLM evals with fewer than a few hundred datapoints

Add code
Mar 04, 2025
Viaarxiv icon