Ding, Y., Zhang, C., Li, J., Lin, H., & Zhang, M. (2025). FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning.
Style de citation Chicago (17e éd.)Ding, Yuyang, Chi Zhang, Juntao Li, Haibin Lin, et Min Zhang. FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning. 2025.
Style de citation MLA (9e éd.)Ding, Yuyang, et al. FAPO: Flawed-Aware Policy Optimization for Efficient and Reliable Reasoning. 2025.
Attention : ces citations peuvent ne pas être correctes à 100%.