Picture for Abhishek Chandwani

Abhishek Chandwani

Beyond Binary Correctness: Scaling Evaluation of Long-Horizon Agents on Subjective Enterprise Tasks

Add code
Mar 24, 2026
Viaarxiv icon

Understanding Virality: A Rubric based Vision-Language Model Framework for Short-Form Edutainment Evaluation

Add code
Dec 24, 2025
Viaarxiv icon