Saved in:
Bibliographic Details
Main Author: Lyu, Yi
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.01607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912997941706752
author Lyu, Yi
author_facet Lyu, Yi
contents Large-scale distributed training has been a research hot spot in machine learning systems for industry and academia in recent years. However, conducting experiments without physical machines and corresponding resources is difficult. One solution is to leverage distributed training simulators, but current ones like ASTRA-sim do not support importing real-world developed models, which poses challenges for ML researchers seeking to use them. Based on this challenge, we developed ModTrans, a translator supporting format translation from any real-world model to the ASTRA-sim simulator's input, removing the barrier between machine learning experts and machine learning system researchers. The experiment results show that ModTrans's cost is negligible.
format Preprint
id arxiv_https___arxiv_org_abs_2604_01607
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle ModTrans: Translating Real-world Models for Distributed Training Simulator
Lyu, Yi
Distributed, Parallel, and Cluster Computing
Artificial Intelligence
Large-scale distributed training has been a research hot spot in machine learning systems for industry and academia in recent years. However, conducting experiments without physical machines and corresponding resources is difficult. One solution is to leverage distributed training simulators, but current ones like ASTRA-sim do not support importing real-world developed models, which poses challenges for ML researchers seeking to use them. Based on this challenge, we developed ModTrans, a translator supporting format translation from any real-world model to the ASTRA-sim simulator's input, removing the barrier between machine learning experts and machine learning system researchers. The experiment results show that ModTrans's cost is negligible.
title ModTrans: Translating Real-world Models for Distributed Training Simulator
topic Distributed, Parallel, and Cluster Computing
Artificial Intelligence
url https://arxiv.org/abs/2604.01607