Picture for Shravan Nayak

Shravan Nayak

Discovering Failure Modes in Vision-Language Models using RL

Add code
Apr 06, 2026
Viaarxiv icon

CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents

Add code
Mar 25, 2026
Viaarxiv icon

EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings

Add code
Mar 13, 2026
Viaarxiv icon

LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs

Add code
Jan 31, 2026
Viaarxiv icon

Grammar Search for Multi-Agent Systems

Add code
Dec 16, 2025
Figure 1 for Grammar Search for Multi-Agent Systems
Figure 2 for Grammar Search for Multi-Agent Systems
Figure 3 for Grammar Search for Multi-Agent Systems
Figure 4 for Grammar Search for Multi-Agent Systems
Viaarxiv icon

Grounding Computer Use Agents on Human Demonstrations

Add code
Nov 10, 2025
Figure 1 for Grounding Computer Use Agents on Human Demonstrations
Figure 2 for Grounding Computer Use Agents on Human Demonstrations
Figure 3 for Grounding Computer Use Agents on Human Demonstrations
Figure 4 for Grounding Computer Use Agents on Human Demonstrations
Viaarxiv icon

Value Drifts: Tracing Value Alignment During LLM Post-Training

Add code
Oct 30, 2025
Viaarxiv icon

CulturalFrames: Assessing Cultural Expectation Alignment in Text-to-Image Models and Evaluation Metrics

Add code
Jun 10, 2025
Viaarxiv icon

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Add code
Mar 19, 2025
Viaarxiv icon

Exploiting Domain-Specific Parallel Data on Multilingual Language Models for Low-resource Language Translation

Add code
Dec 27, 2024
Viaarxiv icon