Saved in:
| Main Authors: | Sharma, Pragya, Qiu, Hang, Srivastava, Mani |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.00005 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Smaller, Smarter, Closer: The Edge of Collaborative Generative AI
by: Morabito, Roberto, et al.
Published: (2025)
by: Morabito, Roberto, et al.
Published: (2025)
Edge-First Language Model Inference: Models, Metrics, and Tradeoffs
by: Jang, SiYoung, et al.
Published: (2025)
by: Jang, SiYoung, et al.
Published: (2025)
Early-Exit meets Model-Distributed Inference at Edge Networks
by: Colocrese, Marco, et al.
Published: (2024)
by: Colocrese, Marco, et al.
Published: (2024)
Trust-Aware Routing for Distributed Generative AI Inference at the Edge
by: Nguyen, Chanh, et al.
Published: (2026)
by: Nguyen, Chanh, et al.
Published: (2026)
HALO: Semantic-Aware Distributed LLM Inference in Lossy Edge Network
by: Zheng, Peirong, et al.
Published: (2026)
by: Zheng, Peirong, et al.
Published: (2026)
Optimizing Resource Allocation for Geographically-Distributed Inference by Large Language Models
by: Sun, Tingyang, et al.
Published: (2025)
by: Sun, Tingyang, et al.
Published: (2025)
SpaceMoE: Realizing Distributed Mixture-of-Experts Inference over Space Networks
by: Wang, Zhanwei, et al.
Published: (2026)
by: Wang, Zhanwei, et al.
Published: (2026)
$Λ$-Split: A Privacy-Preserving Split Computing Framework for Cloud-Powered Generative AI
by: Ohta, Shoki, et al.
Published: (2023)
by: Ohta, Shoki, et al.
Published: (2023)
NebulaFL: Effective Asynchronous Federated Learning for JointCloud Computing
by: Gao, Fei, et al.
Published: (2024)
by: Gao, Fei, et al.
Published: (2024)
An Open API Architecture to Discover the Trustworthy Explanation of Cloud AI Services
by: Wang, Zerui, et al.
Published: (2024)
by: Wang, Zerui, et al.
Published: (2024)
Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference
by: Ye, Shengyuan, et al.
Published: (2024)
by: Ye, Shengyuan, et al.
Published: (2024)
AI Greenferencing: Routing AI Inferencing to Green Modular Data Centers with Heron
by: Reddy, Tella Rajashekhar, et al.
Published: (2025)
by: Reddy, Tella Rajashekhar, et al.
Published: (2025)
Jupiter: Fast and Resource-Efficient Collaborative Inference of Generative LLMs on Edge Devices
by: Ye, Shengyuan, et al.
Published: (2025)
by: Ye, Shengyuan, et al.
Published: (2025)
Dynamic and Distributed Routing in IoT Networks based on Multi-Objective Q-Learning
by: Vaishnav, Shubham, et al.
Published: (2025)
by: Vaishnav, Shubham, et al.
Published: (2025)
Cluster Topology-Driven Placement of Experts Reduces Network Traffic in MoE Inference
by: Sivtsov, Danil, et al.
Published: (2025)
by: Sivtsov, Danil, et al.
Published: (2025)
XWind: A Cross-site Router for Large Language Model Inference Serving at Renewable Energy Farms
by: Reddy, Tella Rajashekhar, et al.
Published: (2026)
by: Reddy, Tella Rajashekhar, et al.
Published: (2026)
Adaptive DNN Partitioning and Offloading in Heterogeneous Edge-Cloud Continuum
by: Deng, Akuen Akoi, et al.
Published: (2026)
by: Deng, Akuen Akoi, et al.
Published: (2026)
Design and Optimization of Hierarchical Gradient Coding for Distributed Learning at Edge Devices
by: Tang, Weiheng, et al.
Published: (2024)
by: Tang, Weiheng, et al.
Published: (2024)
Rina: Enhancing Ring-AllReduce with In-network Aggregation in Distributed Model Training
by: Chen, Zixuan, et al.
Published: (2024)
by: Chen, Zixuan, et al.
Published: (2024)
Resilient by Design -- Active Inference for Distributed Continuum Intelligence
by: Donta, Praveen Kumar, et al.
Published: (2025)
by: Donta, Praveen Kumar, et al.
Published: (2025)
Enabling Intelligent Vehicular Networks Through Distributed Learning in the Non-Terrestrial Networks 6G Vision
by: Naseh, David, et al.
Published: (2023)
by: Naseh, David, et al.
Published: (2023)
FogROS2-FT: Fault Tolerant Cloud Robotics
by: Chen, Kaiyuan, et al.
Published: (2024)
by: Chen, Kaiyuan, et al.
Published: (2024)
Real-time and Downtime-tolerant Fault Diagnosis for Railway Turnout Machines (RTMs) Empowered with Cloud-Edge Pipeline Parallelism
by: Wu, Fan, et al.
Published: (2024)
by: Wu, Fan, et al.
Published: (2024)
Benchmarking Dynamic SLO Compliance in Distributed Computing Continuum Systems
by: Lapkovskis, Alfreds, et al.
Published: (2025)
by: Lapkovskis, Alfreds, et al.
Published: (2025)
SlimCaching: Edge Caching of Mixture-of-Experts for Distributed Inference
by: Chen, Qian, et al.
Published: (2025)
by: Chen, Qian, et al.
Published: (2025)
UCCL-EP: Portable Expert-Parallel Communication
by: Mao, Ziming, et al.
Published: (2025)
by: Mao, Ziming, et al.
Published: (2025)
Cooperation and Personalization on a Seesaw: Choice-based FL for Safe Cooperation in Wireless Networks
by: Zhang, Han, et al.
Published: (2024)
by: Zhang, Han, et al.
Published: (2024)
Vertical Federated Learning for Failure-Cause Identification in Disaggregated Microwave Networks
by: Temiz, Fatih, et al.
Published: (2025)
by: Temiz, Fatih, et al.
Published: (2025)
Implementation of Big AI Models for Wireless Networks with Collaborative Edge Computing
by: Zeng, Liekang, et al.
Published: (2024)
by: Zeng, Liekang, et al.
Published: (2024)
Fedstellar: A Platform for Decentralized Federated Learning
by: Beltrán, Enrique Tomás Martínez, et al.
Published: (2023)
by: Beltrán, Enrique Tomás Martínez, et al.
Published: (2023)
LEO-Split: A Semi-Supervised Split Learning Framework over LEO Satellite Networks
by: Lin, Zheng, et al.
Published: (2025)
by: Lin, Zheng, et al.
Published: (2025)
Federated Continual Learning for Edge-AI: A Comprehensive Survey
by: Wang, Zi, et al.
Published: (2024)
by: Wang, Zi, et al.
Published: (2024)
ProFe: Communication-Efficient Decentralized Federated Learning via Distillation and Prototypes
by: Sánchez, Pedro Miguel Sánchez, et al.
Published: (2024)
by: Sánchez, Pedro Miguel Sánchez, et al.
Published: (2024)
Communication Optimization for Decentralized Learning atop Bandwidth-limited Edge Networks
by: Sun, Tingyang, et al.
Published: (2025)
by: Sun, Tingyang, et al.
Published: (2025)
Resource-Efficient Personal Large Language Models Fine-Tuning with Collaborative Edge Computing
by: Ye, Shengyuan, et al.
Published: (2024)
by: Ye, Shengyuan, et al.
Published: (2024)
Microservice Deployment in Space Computing Power Networks via Robust Reinforcement Learning
by: Yu, Zhiyong, et al.
Published: (2025)
by: Yu, Zhiyong, et al.
Published: (2025)
Hierarchical Split Federated Learning: Convergence Analysis and System Optimization
by: Lin, Zheng, et al.
Published: (2024)
by: Lin, Zheng, et al.
Published: (2024)
Edge-device Collaborative Computing for Multi-view Classification
by: Palena, Marco, et al.
Published: (2024)
by: Palena, Marco, et al.
Published: (2024)
Adaptive Rank Allocation for Federated Parameter-Efficient Fine-Tuning of Language Models
by: Wu, Fei, et al.
Published: (2025)
by: Wu, Fei, et al.
Published: (2025)
ORIENT: A Priority-Aware Energy-Efficient Approach for Latency-Sensitive Applications in 6G
by: Shokrnezhad, Masoud, et al.
Published: (2024)
by: Shokrnezhad, Masoud, et al.
Published: (2024)
Similar Items
-
Smaller, Smarter, Closer: The Edge of Collaborative Generative AI
by: Morabito, Roberto, et al.
Published: (2025) -
Edge-First Language Model Inference: Models, Metrics, and Tradeoffs
by: Jang, SiYoung, et al.
Published: (2025) -
Early-Exit meets Model-Distributed Inference at Edge Networks
by: Colocrese, Marco, et al.
Published: (2024) -
Trust-Aware Routing for Distributed Generative AI Inference at the Edge
by: Nguyen, Chanh, et al.
Published: (2026) -
HALO: Semantic-Aware Distributed LLM Inference in Lossy Edge Network
by: Zheng, Peirong, et al.
Published: (2026)