Picture for Yuxi Dong

Yuxi Dong

Benchmarking LLM-as-a-Judge for Long-Form Output Evaluation

Add code
Jun 01, 2026
Viaarxiv icon