Picture for Naomi Boneh

Naomi Boneh

Metric Match: A Subset Selection Approach to Evaluating LLM Judge Reliability

Add code
Jun 12, 2026
Viaarxiv icon