Saved in:
| Main Authors: | Gravara, Milos, Herrera, Juan Luis, Nastic, Stefan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.20821 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ProbSelect: Stochastic Client Selection for GPU-Accelerated Compute Devices in the 3D Continuum
by: Stanisic, Andrija, et al.
Published: (2025)
by: Stanisic, Andrija, et al.
Published: (2025)
Truffle: Efficient Data Passing for Data-Intensive Serverless Workflows in the Edge-Cloud Continuum
by: Marcelino, Cynthia, et al.
Published: (2024)
by: Marcelino, Cynthia, et al.
Published: (2024)
Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows
by: Yang, Yuting, et al.
Published: (2024)
by: Yang, Yuting, et al.
Published: (2024)
Orchestrating Agents and Data for Enterprise: A Blueprint Architecture for Compound AI
by: Kandogan, Eser, et al.
Published: (2025)
by: Kandogan, Eser, et al.
Published: (2025)
From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow
by: Gupta, Sparsh, et al.
Published: (2025)
by: Gupta, Sparsh, et al.
Published: (2025)
Making Room for AI: Multi-GPU Molecular Dynamics with Deep Potentials in GROMACS
by: Pennati, Luca, et al.
Published: (2026)
by: Pennati, Luca, et al.
Published: (2026)
Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters
by: Dongare, Shruti, et al.
Published: (2025)
by: Dongare, Shruti, et al.
Published: (2025)
Accelerating MoE Model Inference with Expert Sharding
by: Balmau, Oana, et al.
Published: (2025)
by: Balmau, Oana, et al.
Published: (2025)
Deploying Atmospheric and Oceanic AI Models on Chinese Hardware and Framework: Migration Strategies, Performance Optimization and Analysis
by: Sun, Yuze, et al.
Published: (2025)
by: Sun, Yuze, et al.
Published: (2025)
Preventing Rank Collapse in Federated Low-Rank Adaptation with Client Heterogeneity
by: Wu, Fei, et al.
Published: (2026)
by: Wu, Fei, et al.
Published: (2026)
Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients
by: Koo, Jabin, et al.
Published: (2024)
by: Koo, Jabin, et al.
Published: (2024)
FedGreen: Carbon-aware Federated Learning with Model Size Adaptation
by: Abbasi, Ali, et al.
Published: (2024)
by: Abbasi, Ali, et al.
Published: (2024)
Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication Optimization
by: Dong, Jianbo, et al.
Published: (2024)
by: Dong, Jianbo, et al.
Published: (2024)
SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile
by: Niu, Wei, et al.
Published: (2024)
by: Niu, Wei, et al.
Published: (2024)
CoLLM: Continuous Adaptation for SLO-Aware LLM Serving on Shared GPU Clusters
by: Huang, Shaoyuan, et al.
Published: (2026)
by: Huang, Shaoyuan, et al.
Published: (2026)
Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration
by: Ji, Wei, et al.
Published: (2024)
by: Ji, Wei, et al.
Published: (2024)
Databelt: A Continuous Data Path for Serverless Workflows in the 3D Compute Continuum
by: Marcelino, Cynthia, et al.
Published: (2025)
by: Marcelino, Cynthia, et al.
Published: (2025)
Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels
by: Song, Mingcong, et al.
Published: (2024)
by: Song, Mingcong, et al.
Published: (2024)
FedFusion: Federated Learning with Diversity- and Cluster-Aware Encoders for Robust Adaptation under Label Scarcity
by: Kahenga, Ferdinand, et al.
Published: (2025)
by: Kahenga, Ferdinand, et al.
Published: (2025)
Cosmos: A Cost Model for Serverless Workflows in the 3D Compute Continuum
by: Marcelino, Cynthia, et al.
Published: (2025)
by: Marcelino, Cynthia, et al.
Published: (2025)
Action Engine: Automatic Workflow Generation in FaaS
by: Esashi, Akiharu, et al.
Published: (2024)
by: Esashi, Akiharu, et al.
Published: (2024)
Efficient and Scalable Agentic AI with Heterogeneous Systems
by: Asgar, Zain, et al.
Published: (2025)
by: Asgar, Zain, et al.
Published: (2025)
Gradient Correction in Federated Learning with Adaptive Optimization
by: Chen, Evan, et al.
Published: (2025)
by: Chen, Evan, et al.
Published: (2025)
CommunityAI: Towards Community-based Federated Learning
by: Murturi, Ilir, et al.
Published: (2023)
by: Murturi, Ilir, et al.
Published: (2023)
Efficient Chromosome Parallelization for Precision Medicine Genomic Workflows
by: Montserrat, Daniel Mas, et al.
Published: (2025)
by: Montserrat, Daniel Mas, et al.
Published: (2025)
Scalable AI-assisted Workflow Management for Detector Design Optimization Using Distributed Computing
by: Anderson, Derek, et al.
Published: (2026)
by: Anderson, Derek, et al.
Published: (2026)
Optimizing Federated Learning by Entropy-Based Client Selection
by: Lutz, Andreas, et al.
Published: (2024)
by: Lutz, Andreas, et al.
Published: (2024)
Distributed Low-Communication Training with Decoupled Momentum Optimization
by: Nedelkoski, Sasho, et al.
Published: (2025)
by: Nedelkoski, Sasho, et al.
Published: (2025)
Mind the Gap: Revealing Inconsistencies Across Heterogeneous AI Accelerators
by: Wen, Elliott, et al.
Published: (2025)
by: Wen, Elliott, et al.
Published: (2025)
FedSQ: Optimized Weight Averaging via Fixed Gating
by: Pérez-Corral, Cristian, et al.
Published: (2026)
by: Pérez-Corral, Cristian, et al.
Published: (2026)
PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
by: Wan, Xinyi, et al.
Published: (2025)
by: Wan, Xinyi, et al.
Published: (2025)
A Survey on Inference Optimization Techniques for Mixture of Experts Models
by: Liu, Jiacheng, et al.
Published: (2024)
by: Liu, Jiacheng, et al.
Published: (2024)
Optimizing video analytics inference pipelines: a case study
by: Ghafouri, Saeid, et al.
Published: (2025)
by: Ghafouri, Saeid, et al.
Published: (2025)
AIMeter: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads
by: Huang, Hongzhen, et al.
Published: (2025)
by: Huang, Hongzhen, et al.
Published: (2025)
Deep Reinforcement Learning for Optimizing Energy Consumption in Smart Grid Systems
by: Alsheikhi, Abeer, et al.
Published: (2026)
by: Alsheikhi, Abeer, et al.
Published: (2026)
AIConfigurator: Lightning-Fast Configuration Optimization for Multi-Framework LLM Serving
by: Xu, Tianhao, et al.
Published: (2026)
by: Xu, Tianhao, et al.
Published: (2026)
Atomix: Timely, Transactional Tool Use for Reliable Agentic Workflows
by: Mohammadi, Bardia, et al.
Published: (2026)
by: Mohammadi, Bardia, et al.
Published: (2026)
UNIFERENCE: A Discrete Event Simulation Framework for Developing Distributed AI Models
by: Eldenk, Doğaç, et al.
Published: (2026)
by: Eldenk, Doğaç, et al.
Published: (2026)
Streamlining in the Riemannian Realm: Efficient Riemannian Optimization with Loopless Variance Reduction
by: Demidovich, Yury, et al.
Published: (2024)
by: Demidovich, Yury, et al.
Published: (2024)
Intelligent Resource Allocation Optimization for Cloud Computing via Machine Learning
by: Wang, Yuqing, et al.
Published: (2025)
by: Wang, Yuqing, et al.
Published: (2025)
Similar Items
-
ProbSelect: Stochastic Client Selection for GPU-Accelerated Compute Devices in the 3D Continuum
by: Stanisic, Andrija, et al.
Published: (2025) -
Truffle: Efficient Data Passing for Data-Intensive Serverless Workflows in the Edge-Cloud Continuum
by: Marcelino, Cynthia, et al.
Published: (2024) -
Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows
by: Yang, Yuting, et al.
Published: (2024) -
Orchestrating Agents and Data for Enterprise: A Blueprint Architecture for Compound AI
by: Kandogan, Eser, et al.
Published: (2025) -
From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow
by: Gupta, Sparsh, et al.
Published: (2025)