Luo, Y., Zhou, Z., Wang, M., & Dong, B. (2024). Jailbreak Instruction-Tuned LLMs via end-of-sentence MLP Re-weighting.
Chicago Style (17th ed.) CitationLuo, Yifan, Zhennan Zhou, Meitan Wang, and Bin Dong. Jailbreak Instruction-Tuned LLMs via End-of-sentence MLP Re-weighting. 2024.
MLA (9th ed.) CitationLuo, Yifan, et al. Jailbreak Instruction-Tuned LLMs via End-of-sentence MLP Re-weighting. 2024.
Warning: These citations may not always be 100% accurate.