Goldstein, D., Alcaide, E., Lu, J., & Cheah, E. (2025). RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale.
Chicago Style (17th ed.) CitationGoldstein, Daniel, Eric Alcaide, Janna Lu, and Eugene Cheah. RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale. 2025.
MLA (9th ed.) CitationGoldstein, Daniel, et al. RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale. 2025.
Warning: These citations may not always be 100% accurate.