:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gravara, Milos, Herrera, Juan Luis, Nastic, Stefan
Format:	Preprint
Published:	2026
Subjects:	Distributed, Parallel, and Cluster Computing Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2603.20821
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ProbSelect: Stochastic Client Selection for GPU-Accelerated Compute Devices in the 3D Continuum
by: Stanisic, Andrija, et al.
Published: (2025)

Truffle: Efficient Data Passing for Data-Intensive Serverless Workflows in the Edge-Cloud Continuum
by: Marcelino, Cynthia, et al.
Published: (2024)

Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows
by: Yang, Yuting, et al.
Published: (2024)

Orchestrating Agents and Data for Enterprise: A Blueprint Architecture for Compound AI
by: Kandogan, Eser, et al.
Published: (2025)

From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow
by: Gupta, Sparsh, et al.
Published: (2025)

Making Room for AI: Multi-GPU Molecular Dynamics with Deep Potentials in GROMACS
by: Pennati, Luca, et al.
Published: (2026)

Hybrid Learning and Optimization-Based Dynamic Scheduling for DL Workloads on Heterogeneous GPU Clusters
by: Dongare, Shruti, et al.
Published: (2025)

Accelerating MoE Model Inference with Expert Sharding
by: Balmau, Oana, et al.
Published: (2025)

Deploying Atmospheric and Oceanic AI Models on Chinese Hardware and Framework: Migration Strategies, Performance Optimization and Analysis
by: Sun, Yuze, et al.
Published: (2025)

Preventing Rank Collapse in Federated Low-Rank Adaptation with Client Heterogeneity
by: Wu, Fei, et al.
Published: (2026)

Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients
by: Koo, Jabin, et al.
Published: (2024)

FedGreen: Carbon-aware Federated Learning with Model Size Adaptation
by: Abbasi, Ali, et al.
Published: (2024)

Enhancing Large-Scale AI Training Efficiency: The C4 Solution for Real-Time Anomaly Detection and Communication Optimization
by: Dong, Jianbo, et al.
Published: (2024)

SmartMem: Layout Transformation Elimination and Adaptation for Efficient DNN Execution on Mobile
by: Niu, Wei, et al.
Published: (2024)

CoLLM: Continuous Adaptation for SLO-Aware LLM Serving on Shared GPU Clusters
by: Huang, Shaoyuan, et al.
Published: (2026)

Backpropagation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration
by: Ji, Wei, et al.
Published: (2024)

Databelt: A Continuous Data Path for Serverless Workflows in the 3D Compute Continuum
by: Marcelino, Cynthia, et al.
Published: (2025)

Tackling the Dynamicity in a Production LLM Serving System with SOTA Optimizations via Hybrid Prefill/Decode/Verify Scheduling on Efficient Meta-kernels
by: Song, Mingcong, et al.
Published: (2024)

FedFusion: Federated Learning with Diversity- and Cluster-Aware Encoders for Robust Adaptation under Label Scarcity
by: Kahenga, Ferdinand, et al.
Published: (2025)

Cosmos: A Cost Model for Serverless Workflows in the 3D Compute Continuum
by: Marcelino, Cynthia, et al.
Published: (2025)

Action Engine: Automatic Workflow Generation in FaaS
by: Esashi, Akiharu, et al.
Published: (2024)

Efficient and Scalable Agentic AI with Heterogeneous Systems
by: Asgar, Zain, et al.
Published: (2025)

Gradient Correction in Federated Learning with Adaptive Optimization
by: Chen, Evan, et al.
Published: (2025)

CommunityAI: Towards Community-based Federated Learning
by: Murturi, Ilir, et al.
Published: (2023)

Efficient Chromosome Parallelization for Precision Medicine Genomic Workflows
by: Montserrat, Daniel Mas, et al.
Published: (2025)

Scalable AI-assisted Workflow Management for Detector Design Optimization Using Distributed Computing
by: Anderson, Derek, et al.
Published: (2026)

Optimizing Federated Learning by Entropy-Based Client Selection
by: Lutz, Andreas, et al.
Published: (2024)

Distributed Low-Communication Training with Decoupled Momentum Optimization
by: Nedelkoski, Sasho, et al.
Published: (2025)

Mind the Gap: Revealing Inconsistencies Across Heterogeneous AI Accelerators
by: Wen, Elliott, et al.
Published: (2025)

FedSQ: Optimized Weight Averaging via Fixed Gating
by: Pérez-Corral, Cristian, et al.
Published: (2026)

PipeOffload: Improving Scalability of Pipeline Parallelism with Memory Optimization
by: Wan, Xinyi, et al.
Published: (2025)

A Survey on Inference Optimization Techniques for Mixture of Experts Models
by: Liu, Jiacheng, et al.
Published: (2024)

Optimizing video analytics inference pipelines: a case study
by: Ghafouri, Saeid, et al.
Published: (2025)

AIMeter: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads
by: Huang, Hongzhen, et al.
Published: (2025)

Deep Reinforcement Learning for Optimizing Energy Consumption in Smart Grid Systems
by: Alsheikhi, Abeer, et al.
Published: (2026)

AIConfigurator: Lightning-Fast Configuration Optimization for Multi-Framework LLM Serving
by: Xu, Tianhao, et al.
Published: (2026)

Atomix: Timely, Transactional Tool Use for Reliable Agentic Workflows
by: Mohammadi, Bardia, et al.
Published: (2026)

UNIFERENCE: A Discrete Event Simulation Framework for Developing Distributed AI Models
by: Eldenk, Doğaç, et al.
Published: (2026)

Streamlining in the Riemannian Realm: Efficient Riemannian Optimization with Loopless Variance Reduction
by: Demidovich, Yury, et al.
Published: (2024)

Intelligent Resource Allocation Optimization for Cloud Computing via Machine Learning
by: Wang, Yuqing, et al.
Published: (2025)