Picture for Zhonghao Yang

Zhonghao Yang

Benchmarks for Trajectory Safety Evaluation and Diagnosis in OpenClaw and Codex: ATBench-Claw and ATBench-CodeX

Add code
Apr 16, 2026
Viaarxiv icon

ATBench: A Diverse and Realistic Agent Trajectory Benchmark for Safety Evaluation and Diagnosis

Add code
Apr 08, 2026
Viaarxiv icon

ATBench: A Diverse and Realistic Trajectory Benchmark for Long-Horizon Agent Safety

Add code
Apr 02, 2026
Viaarxiv icon

AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security

Add code
Jan 26, 2026
Viaarxiv icon

Toward Efficient Testing of Graph Neural Networks via Test Input Prioritization

Add code
Dec 20, 2025
Viaarxiv icon

ArcGen: Generalizing Neural Backdoor Detection Across Diverse Architectures

Add code
Dec 17, 2025
Viaarxiv icon

Harnessing Scalable Transactional Stream Processing for Managing Large Language Models [Vision]

Add code
Jul 17, 2023
Viaarxiv icon