APA (7th ed.) Citation

Shu, D., Zhang, D., & Hullman, J. (2026). Learning from the Right Rollouts: Data Attribution for PPO-based LLM Post-Training.

Chicago Style (17th ed.) Citation

Shu, Dong, Denghui Zhang, and Jessica Hullman. Learning from the Right Rollouts: Data Attribution for PPO-based LLM Post-Training. 2026.

MLA (9th ed.) Citation

Shu, Dong, et al. Learning from the Right Rollouts: Data Attribution for PPO-based LLM Post-Training. 2026.

Warning: These citations may not always be 100% accurate.