Alert button
Picture for Daniel Scalena

Daniel Scalena

Alert button

Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence

Add code
Bookmark button
Alert button
Sep 01, 2023
Daniel Scalena, Gabriele Sarti, Malvina Nissim, Elisabetta Fersini

Viaarxiv icon