Picture for Mrinal Agarwal

Mrinal Agarwal

Zero-Shot Embedding Drift Detection: A Lightweight Defense Against Prompt Injections in LLMs

Add code
Jan 18, 2026
Viaarxiv icon

WOLF: Werewolf-based Observations for LLM Deception and Falsehoods

Add code
Dec 09, 2025
Viaarxiv icon