Yano, K., Kiyono, S., Kobayashi, S., Takase, S., & Suzuki, J. (2026). Pre-training LLM without Learning Rate Decay Enhances Supervised Fine-Tuning.
Chicago Style (17th ed.) CitationYano, Kazuki, Shun Kiyono, Sosuke Kobayashi, Sho Takase, and Jun Suzuki. Pre-training LLM Without Learning Rate Decay Enhances Supervised Fine-Tuning. 2026.
MLA (9th ed.) CitationYano, Kazuki, et al. Pre-training LLM Without Learning Rate Decay Enhances Supervised Fine-Tuning. 2026.
Warning: These citations may not always be 100% accurate.