Picture for Abubakarr Jaye

Abubakarr Jaye

CORPGEN: Simulating Corporate Environments with Autonomous Digital Employees in Multi-Horizon Task Environments

Add code
Feb 15, 2026
Viaarxiv icon

Auto-Eval Judge: Towards a General Agentic Framework for Task Completion Evaluation

Add code
Aug 07, 2025
Figure 1 for Auto-Eval Judge: Towards a General Agentic Framework for Task Completion Evaluation
Figure 2 for Auto-Eval Judge: Towards a General Agentic Framework for Task Completion Evaluation
Figure 3 for Auto-Eval Judge: Towards a General Agentic Framework for Task Completion Evaluation
Figure 4 for Auto-Eval Judge: Towards a General Agentic Framework for Task Completion Evaluation
Viaarxiv icon