Lin, Y., Jin, D., Xu, T., Wu, T., Sukhbaatar, S., Zhu, C., . . . Fang, H. (2025). Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback.
Style de citation Chicago (17e éd.)Lin, Yen-Ting, et al. Step-KTO: Optimizing Mathematical Reasoning Through Stepwise Binary Feedback. 2025.
Style de citation MLA (9e éd.)Lin, Yen-Ting, et al. Step-KTO: Optimizing Mathematical Reasoning Through Stepwise Binary Feedback. 2025.
Attention : ces citations peuvent ne pas être correctes à 100%.