Saved in:
| Main Authors: | Liang, Zhiming, Chen, Bin, Ye, Litao, Sun, Chen, Wang, Shuo, Peng, Zhe |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.09942 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Blockchain-Enabled Dynamic Spectrum Sharing for Satellite and Terrestrial Communication Networks
by: Wang, Zixin, et al.
Published: (2024)
by: Wang, Zixin, et al.
Published: (2024)
Managing Federated Learning on Decentralized Infrastructures as a Reputation-based Collaborative Workflow
by: Wang, Yuandou, et al.
Published: (2025)
by: Wang, Yuandou, et al.
Published: (2025)
FIRED: a fine-grained robust performance diagnosis framework for cloud applications
by: Xin, Ruyue, et al.
Published: (2022)
by: Xin, Ruyue, et al.
Published: (2022)
Stable-MoE: Lyapunov-based Token Routing for Distributed Mixture-of-Experts Training over Edge Networks
by: Shi, Long, et al.
Published: (2025)
by: Shi, Long, et al.
Published: (2025)
MemAscend: System Memory Optimization for SSD-Offloaded LLM Fine-Tuning
by: Liaw, Yong-Cheng, et al.
Published: (2025)
by: Liaw, Yong-Cheng, et al.
Published: (2025)
Analysis and Optimized CXL-Attached Memory Allocation for Long-Context LLM Fine-Tuning
by: Liaw, Yong-Cheng, et al.
Published: (2025)
by: Liaw, Yong-Cheng, et al.
Published: (2025)
D-VRE: From a Jupyter-enabled Private Research Environment to Decentralized Collaborative Research Ecosystem
by: Wang, Yuandou, et al.
Published: (2024)
by: Wang, Yuandou, et al.
Published: (2024)
CONCUR: High-Throughput Agentic Batch Inference of LLM via Congestion-Based Concurrency Control
by: Chen, Qiaoling, et al.
Published: (2026)
by: Chen, Qiaoling, et al.
Published: (2026)
Gensor: A Graph-based Construction Tensor Compilation Method for Deep Learning
by: Liu, Hangda, et al.
Published: (2025)
by: Liu, Hangda, et al.
Published: (2025)
Towards Seamless Serverless Computing Across an Edge-Cloud Continuum
by: Simion, Emilian, et al.
Published: (2024)
by: Simion, Emilian, et al.
Published: (2024)
Unlocking Dynamic Inter-Client Spatial Dependencies: A Federated Spatio-Temporal Graph Learning Method for Traffic Flow Forecasting
by: Wang, Feng, et al.
Published: (2025)
by: Wang, Feng, et al.
Published: (2025)
Design of quasi phase matching crystal based on differential gray wolf algorithm
by: Chen, He, et al.
Published: (2025)
by: Chen, He, et al.
Published: (2025)
Adaptive multi-criteria-based load balancing technique for resource allocation in fog-cloud environments
by: Gad-Elrab, Ahmed A. A., et al.
Published: (2024)
by: Gad-Elrab, Ahmed A. A., et al.
Published: (2024)
SwitchDelta: Asynchronous Metadata Updating for Distributed Storage with In-Network Data Visibility
by: Li, Junru, et al.
Published: (2025)
by: Li, Junru, et al.
Published: (2025)
Towards Privacy-, Budget-, and Deadline-Aware Service Optimization for Large Medical Image Processing across Hybrid Clouds
by: Wang, Yuandou, et al.
Published: (2024)
by: Wang, Yuandou, et al.
Published: (2024)
A Survey of Synchronization Technologies for Low-power Backscatter Communication
by: Jiang, Wenyuan, et al.
Published: (2025)
by: Jiang, Wenyuan, et al.
Published: (2025)
A Review on Edge Large Language Models: Design, Execution, and Applications
by: Zheng, Yue, et al.
Published: (2024)
by: Zheng, Yue, et al.
Published: (2024)
Stencil Matrixization
by: Zhao, Wenxuan, et al.
Published: (2023)
by: Zhao, Wenxuan, et al.
Published: (2023)
A large-scale distributed parallel discrete event simulation engines based on Warped2 for Wargaming simulation
by: Jia, Xiaoning, et al.
Published: (2025)
by: Jia, Xiaoning, et al.
Published: (2025)
Zeppelin: Balancing Variable-length Workloads in Data Parallel Large Model Training
by: Chen, Chang, et al.
Published: (2025)
by: Chen, Chang, et al.
Published: (2025)
Mining Area Skyline Objects from Map-based Big Data using Apache Spark Framework
by: Li, Chen, et al.
Published: (2024)
by: Li, Chen, et al.
Published: (2024)
SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading
by: Chen, Qiaoling, et al.
Published: (2025)
by: Chen, Qiaoling, et al.
Published: (2025)
SeaLLM: Service-Aware and Latency-Optimized Resource Sharing for Large Language Model Inference
by: Zhao, Yihao, et al.
Published: (2025)
by: Zhao, Yihao, et al.
Published: (2025)
Communication-Efficient Collaborative LLM Inference over LEO Satellite Networks
by: Zhang, Songge, et al.
Published: (2026)
by: Zhang, Songge, et al.
Published: (2026)
Ticket-based multi-strand method for increased efficiency in proof-of-work based blockchains
by: Rudberg, Elias
Published: (2024)
by: Rudberg, Elias
Published: (2024)
Flotilla: A scalable, modular and resilient federated learning framework for heterogeneous resources
by: Banerjee, Roopkatha, et al.
Published: (2025)
by: Banerjee, Roopkatha, et al.
Published: (2025)
Prediction-driven resource provisioning for serverless container runtimes
by: Tomaras, Dimitrios, et al.
Published: (2024)
by: Tomaras, Dimitrios, et al.
Published: (2024)
GENSERVE: Efficient Co-Serving of Heterogeneous Diffusion Model Workloads
by: Ye, Fanjiang, et al.
Published: (2026)
by: Ye, Fanjiang, et al.
Published: (2026)
Byzantine Fault-Tolerant Min-Max Optimization
by: Liu, Shuo, et al.
Published: (2022)
by: Liu, Shuo, et al.
Published: (2022)
AMSP: Reducing Communication Overhead of ZeRO for Efficient LLM Training
by: Chen, Qiaoling, et al.
Published: (2023)
by: Chen, Qiaoling, et al.
Published: (2023)
Efficient Training of Large Language Models on Distributed Infrastructures: A Survey
by: Duan, Jiangfei, et al.
Published: (2024)
by: Duan, Jiangfei, et al.
Published: (2024)
Three Birds, One Stone: Solving the Communication-Memory-Privacy Trilemma in LLM Fine-tuning Over Wireless Networks with Zeroth-Order Optimization
by: Cai, Zhijie, et al.
Published: (2026)
by: Cai, Zhijie, et al.
Published: (2026)
Simplifying HPC resource selection: A tool for optimizing execution time and cost on Azure
by: Netto, Marco A. S., et al.
Published: (2024)
by: Netto, Marco A. S., et al.
Published: (2024)
Neutron particle transport 3D method of characteristic Multi GPU platform Parallel Computing
by: Zhou, Faguo, et al.
Published: (2025)
by: Zhou, Faguo, et al.
Published: (2025)
Edge-Cloud Collaborative Pothole Detection via Onboard Event Screening and Federated Temporal Segmentation
by: Wu, Yingjie, et al.
Published: (2026)
by: Wu, Yingjie, et al.
Published: (2026)
RingAda: Pipelining Large Model Fine-Tuning on Edge Devices with Scheduled Layer Unfreezing
by: Li, Liang, et al.
Published: (2025)
by: Li, Liang, et al.
Published: (2025)
InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding
by: Chen, Qiaoling, et al.
Published: (2024)
by: Chen, Qiaoling, et al.
Published: (2024)
Hexa-MoE: Efficient and Heterogeneous-aware Training for Mixture-of-Experts
by: Luo, Shuqing, et al.
Published: (2024)
by: Luo, Shuqing, et al.
Published: (2024)
MegatronApp: Efficient and Comprehensive Management on Distributed LLM Training
by: Zhao, Bohan, et al.
Published: (2025)
by: Zhao, Bohan, et al.
Published: (2025)
FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework
by: Mei, Junyi, et al.
Published: (2024)
by: Mei, Junyi, et al.
Published: (2024)
Similar Items
-
Blockchain-Enabled Dynamic Spectrum Sharing for Satellite and Terrestrial Communication Networks
by: Wang, Zixin, et al.
Published: (2024) -
Managing Federated Learning on Decentralized Infrastructures as a Reputation-based Collaborative Workflow
by: Wang, Yuandou, et al.
Published: (2025) -
FIRED: a fine-grained robust performance diagnosis framework for cloud applications
by: Xin, Ruyue, et al.
Published: (2022) -
Stable-MoE: Lyapunov-based Token Routing for Distributed Mixture-of-Experts Training over Edge Networks
by: Shi, Long, et al.
Published: (2025) -
MemAscend: System Memory Optimization for SSD-Offloaded LLM Fine-Tuning
by: Liaw, Yong-Cheng, et al.
Published: (2025)