Fan, M., Han, W., Wang, D., Chen, C., Zhang, Z., & Zhou, J. (2026). When Sharpening Becomes Collapse: Sampling Bias and Semantic Coupling in RL with Verifiable Rewards.
Citazione stile Chigago Style (17a edizione)Fan, Mingyuan, Weiguang Han, Daixin Wang, Cen Chen, Zhiqiang Zhang, e Jun Zhou. When Sharpening Becomes Collapse: Sampling Bias and Semantic Coupling in RL with Verifiable Rewards. 2026.
Citatione MLA (9a ed.)Fan, Mingyuan, et al. When Sharpening Becomes Collapse: Sampling Bias and Semantic Coupling in RL with Verifiable Rewards. 2026.
Attenzione: Queste citazioni potrebbero non essere precise al 100%.