Kawasaki, A., Davis, A., & Abbas, H. (2024). Defending Large Language Models Against Attacks With Residual Stream Activation Analysis.
Chicago Style (17th ed.) CitationKawasaki, Amelia, Andrew Davis, and Houssam Abbas. Defending Large Language Models Against Attacks With Residual Stream Activation Analysis. 2024.
MLA (9th ed.) CitationKawasaki, Amelia, et al. Defending Large Language Models Against Attacks With Residual Stream Activation Analysis. 2024.
Warning: These citations may not always be 100% accurate.