Picture for Justin Tang

Justin Tang

Permutation-Consensus Listwise Judging for Robust Factuality Evaluation

Add code
Mar 20, 2026
Viaarxiv icon