Saved in:
| Main Authors: | Chen, Zixuan, Liu, Xuandong, Li, Minglin, Hu, Yinfan, Mei, Hao, Xing, Huifeng, Wang, Hao, Shi, Wanxin, Liu, Sen, Xu, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.19721 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Short-circuiting Rings for Low-Latency AllReduce
by: Hammer, Sarah-Michelle, et al.
Published: (2025)
by: Hammer, Sarah-Michelle, et al.
Published: (2025)
AllReduce Scheduling with Hierarchical Deep Reinforcement Learning
by: Wei, Yufan, et al.
Published: (2025)
by: Wei, Yufan, et al.
Published: (2025)
OptiReduce: Resilient and Tail-Optimal AllReduce for Distributed Deep Learning in the Cloud
by: Warraich, Ertza, et al.
Published: (2023)
by: Warraich, Ertza, et al.
Published: (2023)
Trivance: Latency-Optimal AllReduce by Shortcutting Multiport Networks
by: Juerss, Anton, et al.
Published: (2026)
by: Juerss, Anton, et al.
Published: (2026)
Don't Let a Few Network Failures Slow the Entire AllReduce
by: Chen, Peiqing, et al.
Published: (2026)
by: Chen, Peiqing, et al.
Published: (2026)
RailS: Load Balancing for All-to-All Communication in Distributed Mixture-of-Experts Training
by: Xu, Heng, et al.
Published: (2025)
by: Xu, Heng, et al.
Published: (2025)
FAST: An Efficient Scheduler for All-to-All GPU Communication
by: Lei, Yiran, et al.
Published: (2025)
by: Lei, Yiran, et al.
Published: (2025)
Ethereal: Divide and Conquer Network Load Balancing in Large-Scale Distributed Training
by: Addanki, Vamsi, et al.
Published: (2024)
by: Addanki, Vamsi, et al.
Published: (2024)
Efficient All-to-All Collective Communication Schedules for Direct-Connect Topologies
by: Basu, Prithwish, et al.
Published: (2023)
by: Basu, Prithwish, et al.
Published: (2023)
Revisiting Bruck: Phase-Efficient All-to-All Communication in Reconfigurable Networks
by: Juerss, Anton, et al.
Published: (2026)
by: Juerss, Anton, et al.
Published: (2026)
Dynamic Hierarchical Birkhoff-von Neumann Decomposition for All-to-All GPU Communication
by: Wu, Yen-Chieh, et al.
Published: (2026)
by: Wu, Yen-Chieh, et al.
Published: (2026)
A Distributed Efficient Blockchain Oracle Scheme for Internet of Things
by: Xian, Youquan, et al.
Published: (2023)
by: Xian, Youquan, et al.
Published: (2023)
Graph neural network for in-network placement of real-time metaverse tasks in next-generation network
by: Rashid, Sulaiman Muhammad, et al.
Published: (2024)
by: Rashid, Sulaiman Muhammad, et al.
Published: (2024)
A Mathematical Theory of Hyper-simplex Fractal Network for Blockchain: Part I
by: Yang, Kaiwen, et al.
Published: (2024)
by: Yang, Kaiwen, et al.
Published: (2024)
AirChain: A Novel Blockchain Framework and Low-Cost Device for Democratized Air Quality Data Aggregation
by: Stankiewicz, Samuel
Published: (2024)
by: Stankiewicz, Samuel
Published: (2024)
EdgeTimer: Adaptive Multi-Timescale Scheduling in Mobile Edge Computing with Deep Reinforcement Learning
by: Hao, Yijun, et al.
Published: (2024)
by: Hao, Yijun, et al.
Published: (2024)
Priority based inter-twin communication in vehicular digital twin networks
by: Zia, Qasim, et al.
Published: (2024)
by: Zia, Qasim, et al.
Published: (2024)
Multi-stage Flow Scheduling for LLM Serving
by: Sun, Yijun, et al.
Published: (2026)
by: Sun, Yijun, et al.
Published: (2026)
Network Anomaly Detection in Distributed Edge Computing Infrastructure
by: Marfo, William, et al.
Published: (2025)
by: Marfo, William, et al.
Published: (2025)
Analysing Mechanisms for Virtual Channel Management in Low-Diameter networks
by: Cano, Alejandro, et al.
Published: (2023)
by: Cano, Alejandro, et al.
Published: (2023)
Design and Operation of Shared Machine Learning Clusters on Campus
by: Xu, Kaiqiang, et al.
Published: (2021)
by: Xu, Kaiqiang, et al.
Published: (2021)
A Uniqueness Theorem for Distributed Computation under Physical Constraint
by: Ren, Zhiyuan, et al.
Published: (2025)
by: Ren, Zhiyuan, et al.
Published: (2025)
Policy Design in Zero-Trust Distributed Networks: Challenges and Solutions
by: Sandjaja, Fannya R., et al.
Published: (2025)
by: Sandjaja, Fannya R., et al.
Published: (2025)
Joint wireless and computing resource management with optimal slice selection in in-network-edge metaverse system
by: Rashid, Sulaiman Muhammad, et al.
Published: (2024)
by: Rashid, Sulaiman Muhammad, et al.
Published: (2024)
Contention-Aware Microservice Deployment in Collaborative Mobile Edge Networks
by: Ge, Xinlei, et al.
Published: (2024)
by: Ge, Xinlei, et al.
Published: (2024)
DistriFS: A Platform and User Agnostic Approach to File Distribution
by: Boesch, Julian
Published: (2024)
by: Boesch, Julian
Published: (2024)
FODT: Fast, Online, Distributed and Temporary Failure Recovery Approach for MEC
by: Yuan, Xin, et al.
Published: (2023)
by: Yuan, Xin, et al.
Published: (2023)
Fast Prototyping of Distributed Stream Processing Applications with stream2gym
by: Ifath, Md. Monzurul Amin, et al.
Published: (2024)
by: Ifath, Md. Monzurul Amin, et al.
Published: (2024)
A Multi-Layered Distributed Computing Framework for Enhanced Edge Computing
by: Ma, Ke, et al.
Published: (2024)
by: Ma, Ke, et al.
Published: (2024)
FlowTracer: A Tool for Uncovering Network Path Usage Imbalance in AI Training Clusters
by: Jamil, Hasibul, et al.
Published: (2024)
by: Jamil, Hasibul, et al.
Published: (2024)
Edge-assisted Parallel Uncertain Skyline Processing for Low-latency IoE Analysis
by: Lai, Chuan-Chi, et al.
Published: (2025)
by: Lai, Chuan-Chi, et al.
Published: (2025)
OptiNIC: A Resilient and Tail-Optimal RDMA NIC for Distributed ML Workloads
by: Warraich, Ertza, et al.
Published: (2025)
by: Warraich, Ertza, et al.
Published: (2025)
Temporal-Aware GPU Resource Allocation for Distributed LLM Inference via Reinforcement Learning
by: Du, Chengze, et al.
Published: (2025)
by: Du, Chengze, et al.
Published: (2025)
Heterogeneity-aware P2P Wireless Energy Transfer for Balanced Energy Distribution
by: Ojha, Tamoghna, et al.
Published: (2022)
by: Ojha, Tamoghna, et al.
Published: (2022)
D-LoRa: a Distributed Parameter Adaptation Scheme for LoRa Network
by: Wang, Ruiqi, et al.
Published: (2025)
by: Wang, Ruiqi, et al.
Published: (2025)
SANSee: A Physical-layer Semantic-aware Networking Framework for Distributed Wireless Sensing
by: Zhu, Huixiang, et al.
Published: (2024)
by: Zhu, Huixiang, et al.
Published: (2024)
EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving
by: Liu, Boyi, et al.
Published: (2024)
by: Liu, Boyi, et al.
Published: (2024)
A Density-Delay Law for Stable Event-Driven State Progression in Open Distributed Systems
by: Chen, Bin, et al.
Published: (2026)
by: Chen, Bin, et al.
Published: (2026)
Distributed Simulation for Digital Twins of Large-Scale Real-World DiffServ-Based Networks
by: Huang, Zhuoyao, et al.
Published: (2024)
by: Huang, Zhuoyao, et al.
Published: (2024)
Towards Practical Overlay Networks for Decentralized Federated Learning
by: Hua, Yifan, et al.
Published: (2024)
by: Hua, Yifan, et al.
Published: (2024)
Similar Items
-
Short-circuiting Rings for Low-Latency AllReduce
by: Hammer, Sarah-Michelle, et al.
Published: (2025) -
AllReduce Scheduling with Hierarchical Deep Reinforcement Learning
by: Wei, Yufan, et al.
Published: (2025) -
OptiReduce: Resilient and Tail-Optimal AllReduce for Distributed Deep Learning in the Cloud
by: Warraich, Ertza, et al.
Published: (2023) -
Trivance: Latency-Optimal AllReduce by Shortcutting Multiport Networks
by: Juerss, Anton, et al.
Published: (2026) -
Don't Let a Few Network Failures Slow the Entire AllReduce
by: Chen, Peiqing, et al.
Published: (2026)