Picture for Xinhang Ma

Xinhang Ma

AutoDojo: Adaptive Attacks Expose Superficial Defenses and User-Underspecification Limits in LLM Agents

Add code
Jun 13, 2026
Viaarxiv icon

Protecting Language Models Against Unauthorized Distillation through Trace Rewriting

Add code
Feb 16, 2026
Viaarxiv icon

Conformal Reachability for Safe Control in Unknown Environments

Add code
Feb 03, 2026
Viaarxiv icon