Alagharu, R., Singh, I. S., Shamsudeen, S., Wu, Z., & Panda, A. (2026). From Refusal Tokens to Refusal Control: Discovering and Steering Category-Specific Refusal Directions.
Chicago Style (17th ed.) CitationAlagharu, Rishab, Ishneet Sukhvinder Singh, Shaibi Shamsudeen, Zhen Wu, and Ashwinee Panda. From Refusal Tokens to Refusal Control: Discovering and Steering Category-Specific Refusal Directions. 2026.
MLA (9th ed.) CitationAlagharu, Rishab, et al. From Refusal Tokens to Refusal Control: Discovering and Steering Category-Specific Refusal Directions. 2026.
Warning: These citations may not always be 100% accurate.