Picture for Jenny Bao

Jenny Bao

Segment-Level Coherence for Robust Harmful Intent Probing in LLMs

Add code
Apr 16, 2026
Viaarxiv icon