Picture for Derek Dunfield

Derek Dunfield

APRES: An Agentic Paper Revision and Evaluation System

Add code
Mar 03, 2026
Viaarxiv icon

AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents

Add code
Feb 09, 2026
Viaarxiv icon

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

Add code
Jul 03, 2025
Viaarxiv icon