Picture for Colin Fraser

Colin Fraser

Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic

Add code
Dec 08, 2025
Figure 1 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 2 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 3 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Figure 4 for Balanced Accuracy: The Right Metric for Evaluating LLM Judges -- Explained through Youden's J statistic
Viaarxiv icon