Picture for Chaowei Xiao

Chaowei Xiao

AutoDojo: Adaptive Attacks Expose Superficial Defenses and User-Underspecification Limits in LLM Agents

Add code
Jun 13, 2026
Viaarxiv icon

Runtime Skill Audit: Targeted Runtime Probing for Agent Skill Security

Add code
Jun 10, 2026
Viaarxiv icon

GeoDrive-Bench: Benchmarking Region-Specific Multimodal Reasoning in Autonomous Driving

Add code
Jun 01, 2026
Viaarxiv icon

MaskForge: Structure-Aware Adaptive Attacks for Jailbreaking Diffusion Large Language Models

Add code
Jun 01, 2026
Viaarxiv icon

SafeGen-Bench: Benchmarking Safety in Image-Conditioned Text-to-Video Generation

Add code
May 31, 2026
Viaarxiv icon

When Are Teacher Tokens Reliable? Position-Weighted On-Policy Self-Distillation for Reasoning

Add code
May 20, 2026
Viaarxiv icon

FORTIS: Benchmarking Over-Privilege in Agent Skills

Add code
May 09, 2026
Viaarxiv icon

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

Add code
May 06, 2026
Viaarxiv icon

ShieldNet: Network-Level Guardrails against Emerging Supply-Chain Injections in Agentic Systems

Add code
Apr 06, 2026
Viaarxiv icon

Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks

Add code
Mar 31, 2026
Viaarxiv icon