Picture for Ryan Holbrook

Ryan Holbrook

Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation

Add code
May 01, 2025
Viaarxiv icon