Zeng, Y., Li, G., Miao, Y., Li, X., Wang, Y., Chen, S., . . . Yuan, B. (2026). EAPO: Entropy-Driven Adaptive Positive-Negative Sample Weighting for Policy Optimization in Open-Ended QA.
Chicago Style (17th ed.) CitationZeng, Yunsheng, et al. EAPO: Entropy-Driven Adaptive Positive-Negative Sample Weighting for Policy Optimization in Open-Ended QA. 2026.
MLA (9th ed.) CitationZeng, Yunsheng, et al. EAPO: Entropy-Driven Adaptive Positive-Negative Sample Weighting for Policy Optimization in Open-Ended QA. 2026.
Warning: These citations may not always be 100% accurate.