Saved in:
| Main Authors: | Chang, Yunhan, Magdy, Amr, Spedalieri, Federico M. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.12608 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ContiguousKV: Accelerating LLM Prefill with Granularity-Aligned KV Cache Management
by: Zou, Jing, et al.
Published: (2026)
by: Zou, Jing, et al.
Published: (2026)
Overcoming Memory Constraints in Quantum Circuit Simulation with a High-Fidelity Compression Framework
by: Zhang, Boyuan, et al.
Published: (2024)
by: Zhang, Boyuan, et al.
Published: (2024)
Mosaic: Towards Efficient Training of Multimodal Models with Spatial Resource Multiplexing
by: Wang, Yanbo, et al.
Published: (2026)
by: Wang, Yanbo, et al.
Published: (2026)
Incidence Constraints in Hypergraph Partitioning on GPU
by: Ronzani, Marco, et al.
Published: (2026)
by: Ronzani, Marco, et al.
Published: (2026)
Embedded Distributed Inference of Deep Neural Networks: A Systematic Review
by: Peccia, Federico Nicolás, et al.
Published: (2024)
by: Peccia, Federico Nicolás, et al.
Published: (2024)
Carbon-Aware Workflow Scheduling with Fixed Mapping and Deadline Constraint
by: Schweisgut, Dominik, et al.
Published: (2025)
by: Schweisgut, Dominik, et al.
Published: (2025)
Distributed Optimisation with Linear Equality and Inequality Constraints using PDMM
by: Heusdens, Richard, et al.
Published: (2023)
by: Heusdens, Richard, et al.
Published: (2023)
Green by Design: Constraint-Based Adaptive Deployment in the Cloud Continuum
by: D'Iapico, Andrea, et al.
Published: (2026)
by: D'Iapico, Andrea, et al.
Published: (2026)
Hypergraph Partitioning on GPU with Distinct Incident Hyperedges and Size Constraints
by: Ronzani, Marco, et al.
Published: (2026)
by: Ronzani, Marco, et al.
Published: (2026)
Monte Cimone v3: Where RISC-V Stands in High-Performance Computing
by: Venieri, Emanuele, et al.
Published: (2026)
by: Venieri, Emanuele, et al.
Published: (2026)
AIGC-assisted Federated Learning for Vehicular Edge Intelligence: Vehicle Selection, Resource Allocation and Model Augmentation
by: Qiang, Xianke, et al.
Published: (2025)
by: Qiang, Xianke, et al.
Published: (2025)
Beyond Pre-Training: The Full Lifecycle of Foundation Models on HPC Systems
by: Conciatore, Dino, et al.
Published: (2026)
by: Conciatore, Dino, et al.
Published: (2026)
OpenFLAME: A Federated Spatial Naming Infrastructure
by: Bharadwaj, Sagar, et al.
Published: (2024)
by: Bharadwaj, Sagar, et al.
Published: (2024)
Priority Matters: Optimising Kubernetes Clusters Usage with Constraint-Based Pod Packing
by: Christensen, Henrik Daniel, et al.
Published: (2025)
by: Christensen, Henrik Daniel, et al.
Published: (2025)
GPU Acceleration for Faster Evolutionary Spatial Cyclic Game Systems
by: Sinadjan, Louie
Published: (2025)
by: Sinadjan, Louie
Published: (2025)
REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments
by: Desai, Humaid Ahmed, et al.
Published: (2023)
by: Desai, Humaid Ahmed, et al.
Published: (2023)
On the Operational Resilience of CBDC: Threats and Prospects of Formal Validation for Offline Payments
by: Bernardo, Marco, et al.
Published: (2025)
by: Bernardo, Marco, et al.
Published: (2025)
Boosting LLM Serving through Spatial-Temporal GPU Resource Sharing
by: Lin, Zejia, et al.
Published: (2025)
by: Lin, Zejia, et al.
Published: (2025)
OServe: Accelerating LLM Serving via Spatial-Temporal Workload Orchestration
by: Jiang, Youhe, et al.
Published: (2026)
by: Jiang, Youhe, et al.
Published: (2026)
MuxServe: Flexible Spatial-Temporal Multiplexing for Multiple LLM Serving
by: Duan, Jiangfei, et al.
Published: (2024)
by: Duan, Jiangfei, et al.
Published: (2024)
QoE-oriented Dependent Task Scheduling under Multi-dimensional QoS Constraints over Distributed Networks
by: Fan, Xuwei, et al.
Published: (2023)
by: Fan, Xuwei, et al.
Published: (2023)
ESTA: An Efficient Spatial-Temporal Range Aggregation Query Processing Algorithm for UAV Networks
by: Liu, Liang, et al.
Published: (2023)
by: Liu, Liang, et al.
Published: (2023)
CarbonEdge: Leveraging Mesoscale Spatial Carbon-Intensity Variations for Low Carbon Edge Computing
by: Wu, Li, et al.
Published: (2025)
by: Wu, Li, et al.
Published: (2025)
ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud Environments
by: Lee, Munkyu, et al.
Published: (2024)
by: Lee, Munkyu, et al.
Published: (2024)
ExeGPT: Constraint-Aware Resource Scheduling for LLM Inference
by: Oh, Hyungjun, et al.
Published: (2024)
by: Oh, Hyungjun, et al.
Published: (2024)
Eva: Cost-Efficient Cloud-Based Cluster Scheduling
by: Chang, Tzu-Tao, et al.
Published: (2025)
by: Chang, Tzu-Tao, et al.
Published: (2025)
Large-scale Neural Network Quantum States for ab initio Quantum Chemistry Simulations on Fugaku
by: Xu, Hongtao, et al.
Published: (2025)
by: Xu, Hongtao, et al.
Published: (2025)
MoA-Off: Adaptive Heterogeneous Modality-Aware Offloading with Edge-Cloud Collaboration for Efficient Multimodal LLM Inference
by: Yang, Zheming, et al.
Published: (2025)
by: Yang, Zheming, et al.
Published: (2025)
Picasso: Memory-Efficient Graph Coloring Using Palettes With Applications in Quantum Computing
by: Ferdous, S M, et al.
Published: (2024)
by: Ferdous, S M, et al.
Published: (2024)
Optimization of Hybrid Quantum-Classical Algorithms
by: Remme, Lian, et al.
Published: (2025)
by: Remme, Lian, et al.
Published: (2025)
Zeppelin: Balancing Variable-length Workloads in Data Parallel Large Model Training
by: Chen, Chang, et al.
Published: (2025)
by: Chen, Chang, et al.
Published: (2025)
CFP: Efficient Optimization of Intra-Operator Parallelism Plans for Large Model Training
by: Hu, Weifang, et al.
Published: (2025)
by: Hu, Weifang, et al.
Published: (2025)
Empowering the Quantum Cloud User with QRIO
by: Chakraborty, Shmeelok, et al.
Published: (2024)
by: Chakraborty, Shmeelok, et al.
Published: (2024)
Semantic-aware Token Selection and Resource Optimization for Communication-efficient Split Federated Fine-tuning in Edge Intelligence
by: Qiang, Xianke, et al.
Published: (2026)
by: Qiang, Xianke, et al.
Published: (2026)
MuxTune: Efficient Multi-Task LLM Fine-Tuning in Multi-Tenant Datacenters via Spatial-Temporal Backbone Multiplexing
by: Xue, Chunyu, et al.
Published: (2026)
by: Xue, Chunyu, et al.
Published: (2026)
Beyond 2-Edge-Connectivity: Algorithms and Impossibility for Content-Oblivious Leader Election
by: Chang, Yi-Jun, et al.
Published: (2025)
by: Chang, Yi-Jun, et al.
Published: (2025)
FlexKV: Flexible Index Offloading for Memory-Disaggregated Key-Value Store
by: Hu, Zhisheng, et al.
Published: (2025)
by: Hu, Zhisheng, et al.
Published: (2025)
SLURM Heterogeneous Jobs for Hybrid Classical-Quantum Workflows
by: Esposito, Aniello, et al.
Published: (2025)
by: Esposito, Aniello, et al.
Published: (2025)
The Markovianity of Time: The Category Mistake in Open Quantum Systems
by: Borrill, Paul
Published: (2026)
by: Borrill, Paul
Published: (2026)
FedPBS: Proximal-Balanced Scaling Federated Learning Model for Robust Personalized Training for Non-IID Data
by: AbouNassar, Eman M., et al.
Published: (2026)
by: AbouNassar, Eman M., et al.
Published: (2026)
Similar Items
-
ContiguousKV: Accelerating LLM Prefill with Granularity-Aligned KV Cache Management
by: Zou, Jing, et al.
Published: (2026) -
Overcoming Memory Constraints in Quantum Circuit Simulation with a High-Fidelity Compression Framework
by: Zhang, Boyuan, et al.
Published: (2024) -
Mosaic: Towards Efficient Training of Multimodal Models with Spatial Resource Multiplexing
by: Wang, Yanbo, et al.
Published: (2026) -
Incidence Constraints in Hypergraph Partitioning on GPU
by: Ronzani, Marco, et al.
Published: (2026) -
Embedded Distributed Inference of Deep Neural Networks: A Systematic Review
by: Peccia, Federico Nicolás, et al.
Published: (2024)