Cai, Z., & Hou, H. (2025). EG-MLA: Embedding-Gated Multi-head Latent Attention for Scalable and Efficient LLMs.
Cita Chicago Style (17a ed.)Cai, Zhengge, y Haowen Hou. EG-MLA: Embedding-Gated Multi-head Latent Attention for Scalable and Efficient LLMs. 2025.
Cita MLA (9a ed.)Cai, Zhengge, y Haowen Hou. EG-MLA: Embedding-Gated Multi-head Latent Attention for Scalable and Efficient LLMs. 2025.
Precaución: Estas citas no son 100% exactas.