APA (7th ed.) Citation

Maity, D. (2026). SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for Reinforcement Learning from Human Feedback (RLHF).

Chicago Style (17th ed.) Citation

Maity, Dipan. SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for Reinforcement Learning from Human Feedback (RLHF). 2026.

MLA (9th ed.) Citation

Maity, Dipan. SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for Reinforcement Learning from Human Feedback (RLHF). 2026.

Warning: These citations may not always be 100% accurate.