Zhang, Z., Yang, S., & Kasneci, G. (2026). Consolidating Rewarded Perturbations for LLM Post-Training.
Chicago Style (17th ed.) CitationZhang, Zheyu, Shuo Yang, and Gjergji Kasneci. Consolidating Rewarded Perturbations for LLM Post-Training. 2026.
MLA (9th ed.) CitationZhang, Zheyu, et al. Consolidating Rewarded Perturbations for LLM Post-Training. 2026.
Warning: These citations may not always be 100% accurate.