Southworth, B. S., & Thomas, S. (2026). Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training.
Chicago Style (17th ed.) CitationSouthworth, Ben S., and Stephen Thomas. Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training. 2026.
MLA (9th ed.) CitationSouthworth, Ben S., and Stephen Thomas. Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training. 2026.
Warning: These citations may not always be 100% accurate.