Hughes, A., Goldberg, A., Jha, P., Perer, A., Aletras, N., & Mireshghallah, N. (2026). Boundary-targeted Membership Inference Attacks on Safety Classifiers.
Chicago Style (17th ed.) CitationHughes, Anthony, Alexander Goldberg, Prince Jha, Adam Perer, Nikolaos Aletras, and Niloofar Mireshghallah. Boundary-targeted Membership Inference Attacks on Safety Classifiers. 2026.
MLA (9th ed.) CitationHughes, Anthony, et al. Boundary-targeted Membership Inference Attacks on Safety Classifiers. 2026.
Warning: These citations may not always be 100% accurate.