Picture for Dawn Song

Dawn Song

University of California, Berkeley

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

Add code
May 06, 2026
Viaarxiv icon

The Long-Horizon Task Mirage? Diagnosing Where and Why Agentic Systems Break

Add code
Apr 13, 2026
Viaarxiv icon

Intent-aligned Formal Specification Synthesis via Traceable Refinement

Add code
Apr 12, 2026
Viaarxiv icon

SecPI: Secure Code Generation with Reasoning Models via Security Reasoning Internalization

Add code
Apr 04, 2026
Viaarxiv icon

A Framework for Formalizing LLM Agent Security

Add code
Mar 19, 2026
Viaarxiv icon

CUBE: A Standard for Unifying Agent Benchmarks

Add code
Mar 16, 2026
Viaarxiv icon

The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey

Add code
Mar 11, 2026
Viaarxiv icon

dLLM: Simple Diffusion Language Modeling

Add code
Feb 26, 2026
Viaarxiv icon

Strategy Executability in Mathematical Reasoning: Leveraging Human-Model Differences for Effective Guidance

Add code
Feb 26, 2026
Viaarxiv icon

IMMACULATE: A Practical LLM Auditing Framework via Verifiable Computation

Add code
Feb 26, 2026
Viaarxiv icon