Picture for Siva Reddy

Siva Reddy

AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility

Add code
Jun 11, 2026
Viaarxiv icon

Would you still call this Dax? Novel Visual References in VLMs and Humans

Add code
Jun 03, 2026
Viaarxiv icon

Leveraging Routing Dynamics in Mixture-of-Experts Models for Efficient Language Adaptation

Add code
May 28, 2026
Viaarxiv icon

Weasel: Out-of-Domain Generalization for Web Agents via Importance-Diversity Data Selection

Add code
May 19, 2026
Viaarxiv icon

Forecasting Downstream Performance of LLMs With Proxy Metrics

Add code
May 18, 2026
Viaarxiv icon

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

Add code
Apr 12, 2026
Viaarxiv icon

Structured Distillation of Web Agent Capabilities Enables Generalization

Add code
Apr 09, 2026
Viaarxiv icon

CUBE: A Standard for Unifying Agent Benchmarks

Add code
Mar 16, 2026
Viaarxiv icon

LLM2Vec-Gen: Generative Embeddings from Large Language Models

Add code
Mar 11, 2026
Viaarxiv icon

Operationalising the Superficial Alignment Hypothesis via Task Complexity

Add code
Feb 17, 2026
Viaarxiv icon