Picture for Ahmed Salem

Ahmed Salem

Microsoft Research

LogiPlan: A Structured Benchmark for Logical Planning and Relational Reasoning in LLMs

Add code
Jun 12, 2025
Viaarxiv icon

LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge

Add code
Jun 11, 2025
Viaarxiv icon

Securing AI Agents with Information-Flow Control

Add code
May 29, 2025
Viaarxiv icon

Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models

Add code
May 20, 2025
Viaarxiv icon

Jailbreaking is (Mostly) Simpler Than You Think

Add code
Mar 07, 2025
Viaarxiv icon

Obliviate: Efficient Unmemorization for Protecting Intellectual Property in Large Language Models

Add code
Feb 20, 2025
Viaarxiv icon

Permissive Information-Flow Analysis for Large Language Models

Add code
Oct 04, 2024
Viaarxiv icon

Vera Verto: Multimodal Hijacking Attack

Add code
Jul 31, 2024
Viaarxiv icon

Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification

Add code
Jul 30, 2024
Viaarxiv icon

Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique

Add code
Jul 15, 2024
Figure 1 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 2 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 3 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Figure 4 for Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
Viaarxiv icon