APA (7th ed.) Citation

Chaudhary, G., Behera, L., & Mondal, W. U. (2026). Match or Replay: Self Imitating Proximal Policy Optimization.

Chicago Style (17th ed.) Citation

Chaudhary, Gaurav, Laxmidhar Behera, and Washim Uddin Mondal. Match or Replay: Self Imitating Proximal Policy Optimization. 2026.

MLA (9th ed.) Citation

Chaudhary, Gaurav, et al. Match or Replay: Self Imitating Proximal Policy Optimization. 2026.

Warning: These citations may not always be 100% accurate.