Picture for Oscar Gilg

Oscar Gilg

Probing Persona-Dependent Preferences in Language Models

Add code
May 13, 2026
Viaarxiv icon

Split Personality Training: Revealing Latent Knowledge Through Alternate Personalities

Add code
Feb 05, 2026
Viaarxiv icon