Lu, D., & Rimsky, N. (2024). Investigating Bias Representations in Llama 2 Chat via Activation Steering.
Chicago Style (17th ed.) CitationLu, Dawn, and Nina Rimsky. Investigating Bias Representations in Llama 2 Chat via Activation Steering. 2024.
MLA (9th ed.) CitationLu, Dawn, and Nina Rimsky. Investigating Bias Representations in Llama 2 Chat via Activation Steering. 2024.
Warning: These citations may not always be 100% accurate.