Skip to content
VuFind
  • Login
    • English
    • Deutsch
    • Español
    • Français
    • Italiano
    • 日本語
    • Nederlands
    • Português
    • Português (Brasil)
    • 中文(简体)
    • 中文(繁體)
    • Türkçe
    • עברית
    • Gaeilge
    • Cymraeg
    • Ελληνικά
    • Català
    • Euskara
    • Русский
    • Čeština
    • Suomi
    • Svenska
    • polski
    • Dansk
    • slovenščina
    • اللغة العربية
    • বাংলা
    • Galego
    • Tiếng Việt
    • Hrvatski
    • हिंदी
    • Հայերէն
    • Українська
    • Sámegiella
    • Монгол
    • Māori
Advanced
  • Cite this
  • Text this
  • Email this
  • Print
  • Export Record
    • Export to RefWorks
    • Export to EndNoteWeb
    • Export to EndNote
  • Save to List
  • Permanent link
Cover Image

Saved in:
Bibliographic Details
Main Authors: Yan, Zijie, Bai, Hongxiao, Yao, Xin, Liu, Dennis, Liu, Tong, Liu, Hongbin, Li, Pingtian, Wu, Evan, Fan, Shiqing, Tao, Li, Zhang, Robin, Wang, Yuzhong, Xu, Shifang, Chang, Jack, Chen, Xuwen, Li, Kunlun, Bai, Yan, Deng, Gao, Zheng, Nan, Korthikanti, Vijay Anand, Khattar, Abhinav, He, Ethan, Govande, Soham, Lym, Sangkug, Zhu, Zhongbo, Zhang, Qi, Yuan, Haochen, Ren, Xiaowei, Fu, Deyu, Ma, Tailai, Zhang, Shunkang, Shao, Jiang, Wang, Ray, Rengasamy, Vasudevan, Garg, Rachit, Bhavani, Santosh, Li, Xipeng, Zhou, Chandler, Wu, David, Wei, Yingcan, Aithal, Ashwath, Andersch, Michael, Shoeybi, Mohammad, Yao, Jiajie, Yang, June
Format: Preprint
Published: 2026
Subjects:
Distributed, Parallel, and Cluster Computing
Computation and Language
Machine Learning
Online Access:https://arxiv.org/abs/2603.07685
Tags: Add Tag
No Tags, Be the first to tag this record!
  • Holdings
  • Description
  • Table of Contents
  • Comments
  • Similar Items
  • Staff View

Internet

https://arxiv.org/abs/2603.07685

Similar Items

  • MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core
    by: Liu, Dennis, et al.
    Published: (2025)
  • Upcycling Large Language Models into Mixture of Experts
    by: He, Ethan, et al.
    Published: (2024)
  • DNA Cleavage System by Nanomaterials
    by: Jinci Li, et al.
    Published: (2024)
  • MegatronApp: Efficient and Comprehensive Management on Distributed LLM Training
    by: Zhao, Bohan, et al.
    Published: (2025)
  • Heterogeneous Parallelism for Multimodal Large Language Model Training
    by: Karnati, Yashaswi, et al.
    Published: (2026)

Search Options

  • Search History
  • Advanced Search

Find More

  • Browse the Catalog
  • Browse Alphabetically
  • Explore Channels
  • Course Reserves
  • New Items

Need Help?

  • Search Tips
  • Ask a Librarian
  • FAQs