Picture for Mahisha Ramesh

Mahisha Ramesh

Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems

Add code
Dec 16, 2025
Viaarxiv icon