Wang, H., Ma, C., Reid, I., & Yaqub, M. (2025). Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning.
Chicago Style (17th ed.) CitationWang, Hu, Congbo Ma, Ian Reid, and Mohammad Yaqub. Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning. 2025.
MLA (9th ed.) CitationWang, Hu, et al. Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning. 2025.
Warning: These citations may not always be 100% accurate.