Ai, X., Yang, Q., Wang, P., Deng, L., Zhang, L., Chen, R., & Zhang, G. (2026). HyLRA: Hybrid Layer Reuse Attention for Efficient Long-Context Inference.
Chicago Style (17th ed.) CitationAi, Xuan, Qingqing Yang, Peng Wang, Lei Deng, Lin Zhang, Renhai Chen, and Gong Zhang. HyLRA: Hybrid Layer Reuse Attention for Efficient Long-Context Inference. 2026.
MLA (9th ed.) CitationAi, Xuan, et al. HyLRA: Hybrid Layer Reuse Attention for Efficient Long-Context Inference. 2026.
Warning: These citations may not always be 100% accurate.