Picture for Govind Ramesh

Govind Ramesh

Localizing Prompt Ambiguity in Large Language Models with Probe-Targeted Attribution

Add code
Jun 03, 2026
Viaarxiv icon

GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation

Add code
May 21, 2024
Viaarxiv icon