Picture for Christopher Zanoli

Christopher Zanoli

ELT-Bench-Verified: Benchmark Quality Issues Underestimate AI Agent Capabilities

Add code
Apr 02, 2026
Viaarxiv icon