Picture for Shravan Doda

Shravan Doda

Before the Last Token: Diagnosing Final-Token Safety Probe Failures

Add code
May 12, 2026
Viaarxiv icon

VERBA: Verbalizing Model Differences Using Large Language Models

Add code
Jul 03, 2025
Viaarxiv icon