Gai, J., Zhang, S., Song, X., Wang, B., & Karypis, G. (2026). DualKV: Shared-Prompt Flash Attention for Efficient RL Training with Large Rollouts and Long Contexts.
Chicago Style (17th ed.) CitationGai, Jiading, Shuai Zhang, Xiang Song, Bernie Wang, and George Karypis. DualKV: Shared-Prompt Flash Attention for Efficient RL Training with Large Rollouts and Long Contexts. 2026.
MLA (9th ed.) CitationGai, Jiading, et al. DualKV: Shared-Prompt Flash Attention for Efficient RL Training with Large Rollouts and Long Contexts. 2026.
Warning: These citations may not always be 100% accurate.