Picture for Pulkit Handa

Pulkit Handa

Design Principles for the Construction of a Benchmark Evaluating Security Operation Capabilities of Multi-agent AI Systems

Add code
Mar 30, 2026
Viaarxiv icon