Gong, X., Yang, S., Cao, Z., Billard, L., & Wang, D. (2026). Faithful-Patchscopes: Understanding and Mitigating Model Bias in Hidden Representations Explanation of Large Language Models.
Chicago Style (17th ed.) CitationGong, Xilin, Shu Yang, Zehua Cao, Lynne Billard, and Di Wang. Faithful-Patchscopes: Understanding and Mitigating Model Bias in Hidden Representations Explanation of Large Language Models. 2026.
MLA (9th ed.) CitationGong, Xilin, et al. Faithful-Patchscopes: Understanding and Mitigating Model Bias in Hidden Representations Explanation of Large Language Models. 2026.
Warning: These citations may not always be 100% accurate.