Picture for Dawn Song

Dawn Song

University of California, Berkeley

Revelio: Cost-Efficient Agentic Memory Safety Vulnerability Detection For Repository-Scale Codebases

Add code
Jun 20, 2026
Viaarxiv icon

ChainWorld: Composing Long-Horizon Desktop Workloads from Atomic OSWorld Tasks

Add code
Jun 19, 2026
Viaarxiv icon

When Do Intrinsic Rewards Work for Code Reasoning? A Comprehensive Study

Add code
Jun 18, 2026
Viaarxiv icon

VIMPO: Value-Implicit Policy Optimization for LLMs

Add code
Jun 18, 2026
Viaarxiv icon

Same-Origin Policy for Agentic Browsers

Add code
Jun 12, 2026
Viaarxiv icon

AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility

Add code
Jun 11, 2026
Viaarxiv icon

Representational Similarity and Model Behavior in Multi-Agent Interaction

Add code
Jun 05, 2026
Viaarxiv icon

CyberGym-E2E: Scalable Real-World Benchmark for AI Agents' End-to-End Cybersecurity Capabilities

Add code
Jun 03, 2026
Viaarxiv icon

Agents' Last Exam

Add code
Jun 03, 2026
Viaarxiv icon

Can Generalist Agents Automate Data Curation?

Add code
Jun 02, 2026
Viaarxiv icon