Saved in:
| Main Authors: | Rahman, Asif, Cvetkovic, Veljko, Reece, Kathleen, Walters, Aidan, Hassan, Yasir, Tummeti, Aneesh, Torres, Bryan, Cooney, Denise, Ellis, Margaret, Nikolopoulos, Dimitrios S. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.03906 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
by: Heidari, Sina, et al.
Published: (2026)
by: Heidari, Sina, et al.
Published: (2026)
Taming the Memory Footprint Crisis: System Design for Production Diffusion LLM Serving
by: Fan, Jiakun, et al.
Published: (2025)
by: Fan, Jiakun, et al.
Published: (2025)
APEX: Asynchronous Parallel CPU-GPU Execution for Online LLM Inference on Constrained GPUs
by: Fan, Jiakun, et al.
Published: (2025)
by: Fan, Jiakun, et al.
Published: (2025)
Modality Inflation: Energy Characterization and Optimization Opportunities for MLLM Inference
by: Moghadampanah, Mona, et al.
Published: (2025)
by: Moghadampanah, Mona, et al.
Published: (2025)
Task queue implementation for edge computing platform
by: Maksimovic, Veljko, et al.
Published: (2024)
by: Maksimovic, Veljko, et al.
Published: (2024)
Configuration management in the distributed cloud
by: Ranković, Tamara, et al.
Published: (2024)
by: Ranković, Tamara, et al.
Published: (2024)
QPART: Adaptive Model Quantization and Dynamic Workload Balancing for Accuracy-aware Edge Inference
by: Li, Xiangchen, et al.
Published: (2025)
by: Li, Xiangchen, et al.
Published: (2025)
ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud Environments
by: Lee, Munkyu, et al.
Published: (2024)
by: Lee, Munkyu, et al.
Published: (2024)
Where is the Testbed for my Federated Learning Research?
by: Božič, Janez, et al.
Published: (2024)
by: Božič, Janez, et al.
Published: (2024)
Dirigent: Lightweight Serverless Orchestration
by: Cvetković, Lazar, et al.
Published: (2024)
by: Cvetković, Lazar, et al.
Published: (2024)
Autonomous Electrochemistry Platform with Real-Time Normality Testing of Voltammetry Measurements Using ML
by: Al-Najjar, Anees, et al.
Published: (2025)
by: Al-Najjar, Anees, et al.
Published: (2025)
Towards a real-time distributed feedback system for the transportation assistance of PwD
by: Polenakis, Iosif, et al.
Published: (2024)
by: Polenakis, Iosif, et al.
Published: (2024)
Melding the Serverless Control Plane with the Conventional Cluster Manager for Speed and Resource Efficiency
by: Kondrashov, Leonid, et al.
Published: (2025)
by: Kondrashov, Leonid, et al.
Published: (2025)
CG-Kit: Code Generation Toolkit for Performant and Maintainable Variants of Source Code Applied to Flash-X Hydrodynamics Simulations
by: Rudi, Johann, et al.
Published: (2024)
by: Rudi, Johann, et al.
Published: (2024)
SHEATH: Defending Horizontal Collaboration for Distributed CNNs against Adversarial Noise
by: Asif, Muneeba, et al.
Published: (2024)
by: Asif, Muneeba, et al.
Published: (2024)
Trustworthy Scheduling for Big Data Applications
by: Tomaras, Dimitrios, et al.
Published: (2026)
by: Tomaras, Dimitrios, et al.
Published: (2026)
TIMBER: On supporting data pipelines in Mobile Cloud Environments
by: Tomaras, Dimitrios, et al.
Published: (2024)
by: Tomaras, Dimitrios, et al.
Published: (2024)
Fused Breadth-First Probabilistic Traversals on Distributed GPU Systems
by: Neff, Reece, et al.
Published: (2023)
by: Neff, Reece, et al.
Published: (2023)
Picasso: Memory-Efficient Graph Coloring Using Palettes With Applications in Quantum Computing
by: Ferdous, S M, et al.
Published: (2024)
by: Ferdous, S M, et al.
Published: (2024)
Leveraging Public Cloud Infrastructure for Real-time Connected Vehicle Speed Advisory at a Signalized Corridor
by: Deng, Hsien-Wen, et al.
Published: (2024)
by: Deng, Hsien-Wen, et al.
Published: (2024)
Generalized Data Placement Strategies for Racetrack Memories
by: Khan, Asif Ali, et al.
Published: (2019)
by: Khan, Asif Ali, et al.
Published: (2019)
Prediction-driven resource provisioning for serverless container runtimes
by: Tomaras, Dimitrios, et al.
Published: (2024)
by: Tomaras, Dimitrios, et al.
Published: (2024)
ConfigSpec: Profiling-Based Configuration Selection for Distributed Edge--Cloud Speculative LLM Serving
by: Li, Xiangchen, et al.
Published: (2026)
by: Li, Xiangchen, et al.
Published: (2026)
SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving
by: Li, Xiangchen, et al.
Published: (2025)
by: Li, Xiangchen, et al.
Published: (2025)
Unlocking True Elasticity for the Cloud-Native Era with Dandelion
by: Kuchler, Tom, et al.
Published: (2025)
by: Kuchler, Tom, et al.
Published: (2025)
Low-Latency Privacy-Preserving Deep Learning Design via Secure MPC
by: Lin, Ke, et al.
Published: (2024)
by: Lin, Ke, et al.
Published: (2024)
Optimizing CNN Using HPC Tools
by: Rahman, Shahrin
Published: (2024)
by: Rahman, Shahrin
Published: (2024)
NaturalTurn: A Method to Segment Speech into Psychologically Meaningful Conversational Turns
by: Cooney, Gus, et al.
Published: (2024)
by: Cooney, Gus, et al.
Published: (2024)
SuperSFL: Resource-Heterogeneous Federated Split Learning with Weight-Sharing Super-Networks
by: Asif, Abdullah Al, et al.
Published: (2026)
by: Asif, Abdullah Al, et al.
Published: (2026)
Leveraging Core and Uncore Frequency Scaling for Power-Efficient Serverless Workflows
by: Tzenetopoulos, Achilleas, et al.
Published: (2024)
by: Tzenetopoulos, Achilleas, et al.
Published: (2024)
Future Mining: Learning for Safety and Security
by: Rahman, Md Sazedur, et al.
Published: (2026)
by: Rahman, Md Sazedur, et al.
Published: (2026)
Ichnos: A Carbon Footprint Estimator for Scientific Workflows
by: West, Kathleen, et al.
Published: (2024)
by: West, Kathleen, et al.
Published: (2024)
Stream-K Optimization and Exploration
by: Rackley, Nick, et al.
Published: (2024)
by: Rackley, Nick, et al.
Published: (2024)
A Survey on Scheduling Techniques in the Edge Cloud: Issues, Challenges and Future Directions
by: Asghar, Hassan, et al.
Published: (2022)
by: Asghar, Hassan, et al.
Published: (2022)
CurvFed: Curvature-Aligned Federated Learning for Fairness without Demographics
by: Sharma, Harshit, et al.
Published: (2024)
by: Sharma, Harshit, et al.
Published: (2024)
Enabling Scalability in Asynchronous and Bidirectional Communication in LPWAN
by: Rahman, Mahbubur
Published: (2025)
by: Rahman, Mahbubur
Published: (2025)
HeLoCo: Efficient asynchronous low-communication training under data and device heterogeneity
by: Asif, Abdullah Al, et al.
Published: (2026)
by: Asif, Abdullah Al, et al.
Published: (2026)
Cooperative Gradient Coding
by: Weng, Shudi, et al.
Published: (2025)
by: Weng, Shudi, et al.
Published: (2025)
SmartPQ: An Adaptive Concurrent Priority Queue for NUMA Architectures
by: Giannoula, Christina, et al.
Published: (2024)
by: Giannoula, Christina, et al.
Published: (2024)
Code once, Run Green: Automated Green Code Translation in Serverless Computing
by: Werner, Sebastian, et al.
Published: (2025)
by: Werner, Sebastian, et al.
Published: (2025)
Similar Items
-
FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
by: Heidari, Sina, et al.
Published: (2026) -
Taming the Memory Footprint Crisis: System Design for Production Diffusion LLM Serving
by: Fan, Jiakun, et al.
Published: (2025) -
APEX: Asynchronous Parallel CPU-GPU Execution for Online LLM Inference on Constrained GPUs
by: Fan, Jiakun, et al.
Published: (2025) -
Modality Inflation: Energy Characterization and Optimization Opportunities for MLLM Inference
by: Moghadampanah, Mona, et al.
Published: (2025) -
Task queue implementation for edge computing platform
by: Maksimovic, Veljko, et al.
Published: (2024)