Picture for Peter Kruger

Peter Kruger

AutoBench: Automating LLM Evaluation through Reciprocal Peer Assessment

Add code
Oct 26, 2025
Viaarxiv icon