Picture for Roshni Lulla

Roshni Lulla

Exploitation Without Deception: Dark Triad Feature Steering Reveals Separable Antisocial Circuits in Language Models

Add code
May 10, 2026
Viaarxiv icon

The Subject of Emergent Misalignment in Superintelligence: An Anthropological, Cognitive Neuropsychological, Machine-Learning, and Ontological Perspective

Add code
Dec 19, 2025
Viaarxiv icon