Picture for Amy Heineike

Amy Heineike

A Framework for Evaluating Agentic Skills at Scale

Add code
Jun 16, 2026
Viaarxiv icon

Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering

Add code
Jun 16, 2026
Viaarxiv icon