Picture for Srinivas Sunkara

Srinivas Sunkara

JD

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

Add code
May 13, 2026
Viaarxiv icon

Do Enterprise Systems Need Learned World Models? The Importance of Context to Infer Dynamics

Add code
May 12, 2026
Viaarxiv icon

Super Apriel: One Checkpoint, Many Speeds

Add code
Apr 21, 2026
Viaarxiv icon

Terminal Agents Suffice for Enterprise Automation

Add code
Mar 31, 2026
Viaarxiv icon

Ego2Web: A Web Agent Benchmark Grounded in Egocentric Videos

Add code
Mar 23, 2026
Viaarxiv icon

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Add code
Mar 13, 2026
Viaarxiv icon

AprielGuard

Add code
Dec 23, 2025
Figure 1 for AprielGuard
Figure 2 for AprielGuard
Figure 3 for AprielGuard
Figure 4 for AprielGuard
Viaarxiv icon

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Add code
Dec 05, 2024
Figure 1 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 2 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 3 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Figure 4 for BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks
Viaarxiv icon

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Add code
Feb 19, 2024
Figure 1 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 2 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 3 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 4 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Viaarxiv icon

Towards Better Semantic Understanding of Mobile Interfaces

Add code
Oct 06, 2022
Figure 1 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 2 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 3 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 4 for Towards Better Semantic Understanding of Mobile Interfaces
Viaarxiv icon