Song, R., Song, Z., Guo, H., & Qiang, W. (2025). Causal Reward Adjustment: Mitigating Reward Hacking in External Reasoning via Backdoor Correction.
Citazione stile Chigago Style (17a edizione)Song, Ruike, Zeen Song, Huijie Guo, e Wenwen Qiang. Causal Reward Adjustment: Mitigating Reward Hacking in External Reasoning via Backdoor Correction. 2025.
Citatione MLA (9a ed.)Song, Ruike, et al. Causal Reward Adjustment: Mitigating Reward Hacking in External Reasoning via Backdoor Correction. 2025.
Attenzione: Queste citazioni potrebbero non essere precise al 100%.