Picture for Mahmoud Ghanem

Mahmoud Ghanem

Measuring AI Agents' Progress on Multi-Step Cyber Attack Scenarios

Add code
Mar 11, 2026
Viaarxiv icon

Improving Methodologies for LLM Evaluations Across Global Languages

Add code
Jan 22, 2026
Viaarxiv icon