Lee, T. (2025). State-Dependent Refusal and Learned Incapacity in RLHF-Aligned Language Models.
Chicago Style (17th ed.) CitationLee, TK. State-Dependent Refusal and Learned Incapacity in RLHF-Aligned Language Models. 2025.
MLA (9th ed.) CitationLee, TK. State-Dependent Refusal and Learned Incapacity in RLHF-Aligned Language Models. 2025.
Warning: These citations may not always be 100% accurate.