Bai, B., Wang, X., Ye, P., & Chen, T. (2026). Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards.
Chicago Style (17th ed.) CitationBai, Bizhe, Xinyue Wang, Peng Ye, and Tao Chen. Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards. 2026.
MLA (9th ed.) CitationBai, Bizhe, et al. Learning to Explore with Parameter-Space Noise: A Deep Dive into Parameter-Space Noise for Reinforcement Learning with Verifiable Rewards. 2026.
Warning: These citations may not always be 100% accurate.