Picture for Stine Lyngsø Beltoft

Stine Lyngsø Beltoft

PsychoSafe: Eliciting Psychologically-Informed Refusals in Large Language Models

Add code
Jun 08, 2026
Viaarxiv icon

Emergent Languages in Populations of Language Model Agents: From Token Efficiency to Oversight Evasion

Add code
May 29, 2026
Viaarxiv icon