Gao, H., Zhang, Z., Pang, L., Guo, F., Dou, H., Lv, G., . . . Cheng, X. (2026). DIVA-GRPO: Enhancing Multimodal Reasoning through Difficulty-Adaptive Variant Advantage.
Style de citation Chicago (17e éd.)Gao, Haowen, et al. DIVA-GRPO: Enhancing Multimodal Reasoning Through Difficulty-Adaptive Variant Advantage. 2026.
Style de citation MLA (9e éd.)Gao, Haowen, et al. DIVA-GRPO: Enhancing Multimodal Reasoning Through Difficulty-Adaptive Variant Advantage. 2026.
Attention : ces citations peuvent ne pas être correctes à 100%.