Picture for Roshita Bhonsle

Roshita Bhonsle

Auto-Eval Judge: Towards a General Agentic Framework for Task Completion Evaluation

Add code
Aug 07, 2025
Viaarxiv icon