Braun, J., Eickhoff, C., Krueger, D., Bahrainian, S. A., & Krasheninnikov, D. (2025). Understanding (Un)Reliability of Steering Vectors in Language Models.
Chicago Style (17th ed.) CitationBraun, Joschka, Carsten Eickhoff, David Krueger, Seyed Ali Bahrainian, and Dmitrii Krasheninnikov. Understanding (Un)Reliability of Steering Vectors in Language Models. 2025.
MLA (9th ed.) CitationBraun, Joschka, et al. Understanding (Un)Reliability of Steering Vectors in Language Models. 2025.
Warning: These citations may not always be 100% accurate.