:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chang, Yunhan, Magdy, Amr, Spedalieri, Federico M.
Format:	Preprint
Published:	2025
Subjects:	Distributed, Parallel, and Cluster Computing
Online Access:	https://arxiv.org/abs/2505.12608
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ContiguousKV: Accelerating LLM Prefill with Granularity-Aligned KV Cache Management
by: Zou, Jing, et al.
Published: (2026)

Overcoming Memory Constraints in Quantum Circuit Simulation with a High-Fidelity Compression Framework
by: Zhang, Boyuan, et al.
Published: (2024)

Mosaic: Towards Efficient Training of Multimodal Models with Spatial Resource Multiplexing
by: Wang, Yanbo, et al.
Published: (2026)

Incidence Constraints in Hypergraph Partitioning on GPU
by: Ronzani, Marco, et al.
Published: (2026)

Embedded Distributed Inference of Deep Neural Networks: A Systematic Review
by: Peccia, Federico Nicolás, et al.
Published: (2024)

Carbon-Aware Workflow Scheduling with Fixed Mapping and Deadline Constraint
by: Schweisgut, Dominik, et al.
Published: (2025)

Distributed Optimisation with Linear Equality and Inequality Constraints using PDMM
by: Heusdens, Richard, et al.
Published: (2023)

Green by Design: Constraint-Based Adaptive Deployment in the Cloud Continuum
by: D'Iapico, Andrea, et al.
Published: (2026)

Hypergraph Partitioning on GPU with Distinct Incident Hyperedges and Size Constraints
by: Ronzani, Marco, et al.
Published: (2026)

Monte Cimone v3: Where RISC-V Stands in High-Performance Computing
by: Venieri, Emanuele, et al.
Published: (2026)

AIGC-assisted Federated Learning for Vehicular Edge Intelligence: Vehicle Selection, Resource Allocation and Model Augmentation
by: Qiang, Xianke, et al.
Published: (2025)

Beyond Pre-Training: The Full Lifecycle of Foundation Models on HPC Systems
by: Conciatore, Dino, et al.
Published: (2026)

OpenFLAME: A Federated Spatial Naming Infrastructure
by: Bharadwaj, Sagar, et al.
Published: (2024)

Priority Matters: Optimising Kubernetes Clusters Usage with Constraint-Based Pod Packing
by: Christensen, Henrik Daniel, et al.
Published: (2025)

GPU Acceleration for Faster Evolutionary Spatial Cyclic Game Systems
by: Sinadjan, Louie
Published: (2025)

REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments
by: Desai, Humaid Ahmed, et al.
Published: (2023)

On the Operational Resilience of CBDC: Threats and Prospects of Formal Validation for Offline Payments
by: Bernardo, Marco, et al.
Published: (2025)

Boosting LLM Serving through Spatial-Temporal GPU Resource Sharing
by: Lin, Zejia, et al.
Published: (2025)

OServe: Accelerating LLM Serving via Spatial-Temporal Workload Orchestration
by: Jiang, Youhe, et al.
Published: (2026)

MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving
by: Duan, Jiangfei, et al.
Published: (2024)

QoE-oriented Dependent Task Scheduling under Multi-dimensional QoS Constraints over Distributed Networks
by: Fan, Xuwei, et al.
Published: (2023)

ESTA: An Efficient Spatial-Temporal Range Aggregation Query Processing Algorithm for UAV Networks
by: Liu, Liang, et al.
Published: (2023)

CarbonEdge: Leveraging Mesoscale Spatial Carbon-Intensity Variations for Low Carbon Edge Computing
by: Wu, Li, et al.
Published: (2025)

ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud Environments
by: Lee, Munkyu, et al.
Published: (2024)

ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference
by: Oh, Hyungjun, et al.
Published: (2024)

Eva: Cost-Efficient Cloud-Based Cluster Scheduling
by: Chang, Tzu-Tao, et al.
Published: (2025)

Large-scale Neural Network Quantum States for ab initio Quantum Chemistry Simulations on Fugaku
by: Xu, Hongtao, et al.
Published: (2025)

MoA-Off: Adaptive Heterogeneous Modality-Aware Offloading with Edge-Cloud Collaboration for Efficient Multimodal LLM Inference
by: Yang, Zheming, et al.
Published: (2025)

Picasso: Memory-Efficient Graph Coloring Using Palettes With Applications in Quantum Computing
by: Ferdous, S M, et al.
Published: (2024)

Optimization of Hybrid Quantum-Classical Algorithms
by: Remme, Lian, et al.
Published: (2025)

Zeppelin: Balancing Variable-length Workloads in Data Parallel Large Model Training
by: Chen, Chang, et al.
Published: (2025)

CFP: Efficient Optimization of Intra-Operator Parallelism Plans for Large Model Training
by: Hu, Weifang, et al.
Published: (2025)

Empowering the Quantum Cloud User with QRIO
by: Chakraborty, Shmeelok, et al.
Published: (2024)

Semantic-aware Token Selection and Resource Optimization for Communication-efficient Split Federated Fine-tuning in Edge Intelligence
by: Qiang, Xianke, et al.
Published: (2026)

MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing
by: Xue, Chunyu, et al.
Published: (2026)

Beyond 2-Edge-Connectivity: Algorithms and Impossibility for Content-Oblivious Leader Election
by: Chang, Yi-Jun, et al.
Published: (2025)

FlexKV: Flexible Index Offloading for Memory-Disaggregated Key-Value Store
by: Hu, Zhisheng, et al.
Published: (2025)

SLURM Heterogeneous Jobs for Hybrid Classical-Quantum Workflows
by: Esposito, Aniello, et al.
Published: (2025)

The Markovianity of Time: The Category Mistake in Open Quantum Systems
by: Borrill, Paul
Published: (2026)

FedPBS: Proximal-Balanced Scaling Federated Learning Model for Robust Personalized Training for Non-IID Data
by: AbouNassar, Eman M., et al.
Published: (2026)