Picture for Narmeen Fatimah Oozeer

Narmeen Fatimah Oozeer

The Capability Frontier: Benchmarks Miss 82% of Model Performance

Add code
Jun 25, 2026
Viaarxiv icon

Approximating Human Preferences Using a Multi-Judge Learned System

Add code
Oct 29, 2025
Viaarxiv icon