Feng, Y., Wu, Z., Wu, Z., Gu, J., & Yu, J. (2026). M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit.
Chicago Style (17th ed.) CitationFeng, Yukai, Zhiheng Wu, Zhengxing Wu, Junwen Gu, and Junzhi Yu. M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit. 2026.
MLA (9th ed.) CitationFeng, Yukai, et al. M$^{2}$GRPO: Mamba-based Multi-Agent Group Relative Policy Optimization for Biomimetic Underwater Robots Pursuit. 2026.
Warning: These citations may not always be 100% accurate.