Picture for Harsha Vardhan Khurdula

Harsha Vardhan Khurdula

The Structured Output Benchmark: A Multi-Source Benchmark for Evaluating Structured Output Quality in Large Language Models

Add code
Apr 28, 2026
Viaarxiv icon

Interfaze: The Future of AI is built on Task-Specific Small Models

Add code
Feb 04, 2026
Viaarxiv icon

Beyond Visual Understanding: Introducing PARROT-360V for Vision Language Model Benchmarking

Add code
Nov 20, 2024
Viaarxiv icon