Guardado en:
| Autores principales: | Sultana, Abeda, Pakka, Nabin, Xu, Fei, Yuan, Xu, Chen, Li, Tzeng, Nian-Feng |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2503.10918 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Prima.cpp: Fast 30-70B LLM Inference on Heterogeneous and Low-Resource Home Clusters
por: Li, Zonghang, et al.
Publicado: (2025)
por: Li, Zonghang, et al.
Publicado: (2025)
FedPLT: Scalable, Resource-Efficient, and Heterogeneity-Aware Federated Learning via Partial Layer Training
por: Dabaja, Ahmad, et al.
Publicado: (2026)
por: Dabaja, Ahmad, et al.
Publicado: (2026)
Aergia: Leveraging Heterogeneity in Federated Learning Systems
por: Cox, Bart, et al.
Publicado: (2022)
por: Cox, Bart, et al.
Publicado: (2022)
Towards Optimal Heterogeneous Client Sampling in Multi-Model Federated Learning
por: Zhang, Haoran, et al.
Publicado: (2025)
por: Zhang, Haoran, et al.
Publicado: (2025)
Connecting Large Language Model Agent to High Performance Computing Resource
por: Ma, Heng, et al.
Publicado: (2025)
por: Ma, Heng, et al.
Publicado: (2025)
AutoDDL: Automatic Distributed Deep Learning with Near-Optimal Bandwidth Cost
por: Chen, Jinfan, et al.
Publicado: (2023)
por: Chen, Jinfan, et al.
Publicado: (2023)
How Machine Learning-Data Driven Replication Strategies Enhance Fault Tolerance in Large-Scale Distributed Systems
por: Murimi, Almond Kiruthu
Publicado: (2025)
por: Murimi, Almond Kiruthu
Publicado: (2025)
AMP4EC: Adaptive Model Partitioning Framework for Efficient Deep Learning Inference in Edge Computing Environments
por: Zhang, Guilin, et al.
Publicado: (2025)
por: Zhang, Guilin, et al.
Publicado: (2025)
SparkAttention: High-Performance Multi-Head Attention for Large Models on Volta GPU Architecture
por: Xu, Youxuan, et al.
Publicado: (2025)
por: Xu, Youxuan, et al.
Publicado: (2025)
Accelerating Geo-distributed Machine Learning with Network-Aware Adaptive Tree and Auxiliary Route
por: Li, Zonghang, et al.
Publicado: (2024)
por: Li, Zonghang, et al.
Publicado: (2024)
SI-ChainFL: Shapley-Incentivized Secure Federated Learning for High-Speed Rail Data Sharing
por: Zhao, Mingjie, et al.
Publicado: (2026)
por: Zhao, Mingjie, et al.
Publicado: (2026)
Roadmap for Edge AI: A Dagstuhl Perspective
por: Ding, Aaron Yi, et al.
Publicado: (2021)
por: Ding, Aaron Yi, et al.
Publicado: (2021)
Parameterizing Federated Continual Learning for Reproducible Research
por: Cox, Bart, et al.
Publicado: (2024)
por: Cox, Bart, et al.
Publicado: (2024)
Asynchronous Byzantine Federated Learning
por: Cox, Bart, et al.
Publicado: (2024)
por: Cox, Bart, et al.
Publicado: (2024)
Hyper-parameter Optimization for Federated Learning with Step-wise Adaptive Mechanism
por: Saadati, Yasaman, et al.
Publicado: (2024)
por: Saadati, Yasaman, et al.
Publicado: (2024)
Training Diffusion Models with Federated Learning
por: de Goede, Matthijs, et al.
Publicado: (2024)
por: de Goede, Matthijs, et al.
Publicado: (2024)
Quantize Once, Train Fast: Allreduce-Compatible Compression with Provable Guarantees
por: Xin, Jihao, et al.
Publicado: (2023)
por: Xin, Jihao, et al.
Publicado: (2023)
Comparison of Autoscaling Frameworks for Containerised Machine-Learning-Applications in a Local and Cloud Environment
por: Schroeder, Christian, et al.
Publicado: (2023)
por: Schroeder, Christian, et al.
Publicado: (2023)
Asynchronous Multi-Server Federated Learning for Geo-Distributed Clients
por: Zuo, Yuncong, et al.
Publicado: (2024)
por: Zuo, Yuncong, et al.
Publicado: (2024)
ADF-LoRA: Alternating Low-Rank Aggregation for Decentralized Federated Fine-Tuning
por: Wang, Xiaoyu, et al.
Publicado: (2025)
por: Wang, Xiaoyu, et al.
Publicado: (2025)
DAGER: Exact Gradient Inversion for Large Language Models
por: Petrov, Ivo, et al.
Publicado: (2024)
por: Petrov, Ivo, et al.
Publicado: (2024)
CodeCRDT: Observation-Driven Coordination for Multi-Agent LLM Code Generation
por: Pugachev, Sergey
Publicado: (2025)
por: Pugachev, Sergey
Publicado: (2025)
Separating Intelligence from Execution: A Workflow Engine for the Model Context Protocol
por: Parmar, Abhinav Singh
Publicado: (2026)
por: Parmar, Abhinav Singh
Publicado: (2026)
Libra: Unleashing GPU Heterogeneity for High-Performance Sparse Matrix Multiplication
por: Shi, Jinliang, et al.
Publicado: (2025)
por: Shi, Jinliang, et al.
Publicado: (2025)
A Full Compression Pipeline for Green Federated Learning in Communication-Constrained Environments
por: Colybes, Elouan, et al.
Publicado: (2026)
por: Colybes, Elouan, et al.
Publicado: (2026)
TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices
por: Li, Zonghang, et al.
Publicado: (2024)
por: Li, Zonghang, et al.
Publicado: (2024)
Learning In Chaos: Efficient Autoscaling and Self-Healing for Multi-Party Distributed Training
por: Feng, Wenjiao, et al.
Publicado: (2025)
por: Feng, Wenjiao, et al.
Publicado: (2025)
Agentic Compilation: Mitigating the LLM Rerun Crisis for Minimized-Inference-Cost Web Automation
por: Chundru, Jagadeesh
Publicado: (2026)
por: Chundru, Jagadeesh
Publicado: (2026)
Federated Learning Model Aggregation in Heterogenous Aerial and Space Networks
por: Dong, Fan, et al.
Publicado: (2023)
por: Dong, Fan, et al.
Publicado: (2023)
Near-Optimal Sparse Allreduce for Distributed Deep Learning
por: Li, Shigang, et al.
Publicado: (2022)
por: Li, Shigang, et al.
Publicado: (2022)
FlashSparse: Minimizing Computation Redundancy for Fast Sparse Matrix Multiplications on Tensor Cores
por: Shi, Jinliang, et al.
Publicado: (2024)
por: Shi, Jinliang, et al.
Publicado: (2024)
HFedATM: Hierarchical Federated Domain Generalization via Optimal Transport and Regularized Mean Aggregation
por: Nguyen, Thinh, et al.
Publicado: (2025)
por: Nguyen, Thinh, et al.
Publicado: (2025)
TokenCake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications
por: Bian, Zhuohang, et al.
Publicado: (2025)
por: Bian, Zhuohang, et al.
Publicado: (2025)
Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing
por: Gebrekidan, Tesfay Zemuy, et al.
Publicado: (2024)
por: Gebrekidan, Tesfay Zemuy, et al.
Publicado: (2024)
Scheduling the Unschedulable: Taming Black-Box LLM Inference at Scale
por: Yuan, Renzhong, et al.
Publicado: (2026)
por: Yuan, Renzhong, et al.
Publicado: (2026)
Adaptive GPU Resource Allocation for Multi-Agent Collaborative Reasoning in Serverless Environments
por: Zhang, Guilin, et al.
Publicado: (2025)
por: Zhang, Guilin, et al.
Publicado: (2025)
A Framework for the Interoperability of Cloud Platforms: Towards FAIR Data in SAFE Environments
por: Grossman, Robert L., et al.
Publicado: (2022)
por: Grossman, Robert L., et al.
Publicado: (2022)
Uncertainty Estimation in Multi-Agent Distributed Learning for AI-Enabled Edge Devices
por: Radchenko, Gleb, et al.
Publicado: (2024)
por: Radchenko, Gleb, et al.
Publicado: (2024)
Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation
por: Radchenko, Gleb, et al.
Publicado: (2024)
por: Radchenko, Gleb, et al.
Publicado: (2024)
SPEAR++: Scaling Gradient Inversion via Sparsely-Used Dictionary Learning
por: Bakarsky, Alexander, et al.
Publicado: (2025)
por: Bakarsky, Alexander, et al.
Publicado: (2025)
Ejemplares similares
-
Prima.cpp: Fast 30-70B LLM Inference on Heterogeneous and Low-Resource Home Clusters
por: Li, Zonghang, et al.
Publicado: (2025) -
FedPLT: Scalable, Resource-Efficient, and Heterogeneity-Aware Federated Learning via Partial Layer Training
por: Dabaja, Ahmad, et al.
Publicado: (2026) -
Aergia: Leveraging Heterogeneity in Federated Learning Systems
por: Cox, Bart, et al.
Publicado: (2022) -
Towards Optimal Heterogeneous Client Sampling in Multi-Model Federated Learning
por: Zhang, Haoran, et al.
Publicado: (2025) -
Connecting Large Language Model Agent to High Performance Computing Resource
por: Ma, Heng, et al.
Publicado: (2025)