Alert button

Adversarial Fine-Tuning of Language Models: An Iterative Optimisation Approach for the Generation and Detection of Problematic Content

Aug 26, 2023
Charles O'Neill, Jack Miller, Ioana Ciuca, Yuan-Sen Ting, Thang Bui

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: