Gu, H., Wang, H., Liu, J., Li, L., Zhu, Q., Liu, B., . . . Guo, Y. (2026). QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training under Training--Inference Mismatch.
Chicago Style (17th ed.) CitationGu, Hao, et al. QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training Under Training--Inference Mismatch. 2026.
MLA (9th ed.) CitationGu, Hao, et al. QaRL: Rollout-Aligned Quantization-Aware RL for Fast and Stable Training Under Training--Inference Mismatch. 2026.
Warning: These citations may not always be 100% accurate.