Berg, C., & Lulla, R. (2026). Exploitation Without Deception: Dark Triad Feature Steering Reveals Separable Antisocial Circuits in Language Models.
Chicago Style (17th ed.) CitationBerg, Cameron, and Roshni Lulla. Exploitation Without Deception: Dark Triad Feature Steering Reveals Separable Antisocial Circuits in Language Models. 2026.
MLA (9th ed.) CitationBerg, Cameron, and Roshni Lulla. Exploitation Without Deception: Dark Triad Feature Steering Reveals Separable Antisocial Circuits in Language Models. 2026.
Warning: These citations may not always be 100% accurate.