Wang, H., Wang, G., & Zhang, H. (2024). Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks.
Chicago Style (17th ed.) CitationWang, Han, Gang Wang, and Huan Zhang. Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks. 2024.
MLA (9th ed.) CitationWang, Han, et al. Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks. 2024.
Warning: These citations may not always be 100% accurate.