Picture for Sahar Abdelnabi

Sahar Abdelnabi

Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks

Add code
Feb 25, 2026
Viaarxiv icon

Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems

Add code
Feb 16, 2026
Viaarxiv icon

Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs

Add code
Feb 09, 2026
Viaarxiv icon

ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations

Add code
Nov 07, 2025
Figure 1 for ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Figure 2 for ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Figure 3 for ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Figure 4 for ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
Viaarxiv icon

Agent Skills Enable a New Class of Realistic and Trivially Simple Prompt Injections

Add code
Oct 30, 2025
Viaarxiv icon

Terrarium: Revisiting the Blackboard for Multi-Agent Safety, Privacy, and Security Studies

Add code
Oct 16, 2025
Viaarxiv icon

LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge

Add code
Jun 11, 2025
Viaarxiv icon

Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models

Add code
May 20, 2025
Figure 1 for Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models
Figure 2 for Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models
Figure 3 for Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models
Figure 4 for Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models
Viaarxiv icon

Taxonomy, Opportunities, and Challenges of Representation Engineering for Large Language Models

Add code
Feb 27, 2025
Viaarxiv icon

Safety is Essential for Responsible Open-Ended Systems

Add code
Feb 06, 2025
Figure 1 for Safety is Essential for Responsible Open-Ended Systems
Figure 2 for Safety is Essential for Responsible Open-Ended Systems
Viaarxiv icon