APA (7th ed.) Citation

García-Carrasco, J., Maté, A., & Trujillo, J. (2024). Detecting and Understanding Vulnerabilities in Language Models via Mechanistic Interpretability.

Chicago Style (17th ed.) Citation

García-Carrasco, Jorge, Alejandro Maté, and Juan Trujillo. Detecting and Understanding Vulnerabilities in Language Models via Mechanistic Interpretability. 2024.

MLA (9th ed.) Citation

García-Carrasco, Jorge, et al. Detecting and Understanding Vulnerabilities in Language Models via Mechanistic Interpretability. 2024.

Warning: These citations may not always be 100% accurate.