Le, T., Van, L. N., & Le, T. (2025). Sharpness-Guided Group Relative Policy Optimization via Probability Shaping.
Chicago Style (17th ed.) CitationLe, Tue, Linh Ngo Van, and Trung Le. Sharpness-Guided Group Relative Policy Optimization via Probability Shaping. 2025.
MLA (9th ed.) CitationLe, Tue, et al. Sharpness-Guided Group Relative Policy Optimization via Probability Shaping. 2025.
Warning: These citations may not always be 100% accurate.