:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rahman, Asif, Cvetkovic, Veljko, Reece, Kathleen, Walters, Aidan, Hassan, Yasir, Tummeti, Aneesh, Torres, Bryan, Cooney, Denise, Ellis, Margaret, Nikolopoulos, Dimitrios S.
Format:	Preprint
Published:	2025
Subjects:	Distributed, Parallel, and Cluster Computing Machine Learning Software Engineering
Online Access:	https://arxiv.org/abs/2505.03906
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FACT: Compositional Kernel Synthesis with a Three-Stage Agentic Workflow
by: Heidari, Sina, et al.
Published: (2026)

Taming the Memory Footprint Crisis: System Design for Production Diffusion LLM Serving
by: Fan, Jiakun, et al.
Published: (2025)

APEX: Asynchronous Parallel CPU-GPU Execution for Online LLM Inference on Constrained GPUs
by: Fan, Jiakun, et al.
Published: (2025)

Modality Inflation: Energy Characterization and Optimization Opportunities for MLLM Inference
by: Moghadampanah, Mona, et al.
Published: (2025)

Task queue implementation for edge computing platform
by: Maksimovic, Veljko, et al.
Published: (2024)

Configuration management in the distributed cloud
by: Ranković, Tamara, et al.
Published: (2024)

QPART: Adaptive Model Quantization and Dynamic Workload Balancing for Accuracy-aware Edge Inference
by: Li, Xiangchen, et al.
Published: (2025)

ParvaGPU: Efficient Spatial GPU Sharing for Large-Scale DNN Inference in Cloud Environments
by: Lee, Munkyu, et al.
Published: (2024)

Where is the Testbed for my Federated Learning Research?
by: Božič, Janez, et al.
Published: (2024)

Dirigent: Lightweight Serverless Orchestration
by: Cvetković, Lazar, et al.
Published: (2024)

Autonomous Electrochemistry Platform with Real-Time Normality Testing of Voltammetry Measurements Using ML
by: Al-Najjar, Anees, et al.
Published: (2025)

Towards a real-time distributed feedback system for the transportation assistance of PwD
by: Polenakis, Iosif, et al.
Published: (2024)

Melding the Serverless Control Plane with the Conventional Cluster Manager for Speed and Resource Efficiency
by: Kondrashov, Leonid, et al.
Published: (2025)

CG-Kit: Code Generation Toolkit for Performant and Maintainable Variants of Source Code Applied to Flash-X Hydrodynamics Simulations
by: Rudi, Johann, et al.
Published: (2024)

SHEATH: Defending Horizontal Collaboration for Distributed CNNs against Adversarial Noise
by: Asif, Muneeba, et al.
Published: (2024)

Trustworthy Scheduling for Big Data Applications
by: Tomaras, Dimitrios, et al.
Published: (2026)

TIMBER: On supporting data pipelines in Mobile Cloud Environments
by: Tomaras, Dimitrios, et al.
Published: (2024)

Fused Breadth-First Probabilistic Traversals on Distributed GPU Systems
by: Neff, Reece, et al.
Published: (2023)

Picasso: Memory-Efficient Graph Coloring Using Palettes With Applications in Quantum Computing
by: Ferdous, S M, et al.
Published: (2024)

Leveraging Public Cloud Infrastructure for Real-time Connected Vehicle Speed Advisory at a Signalized Corridor
by: Deng, Hsien-Wen, et al.
Published: (2024)

Generalized Data Placement Strategies for Racetrack Memories
by: Khan, Asif Ali, et al.
Published: (2019)

Prediction-driven resource provisioning for serverless container runtimes
by: Tomaras, Dimitrios, et al.
Published: (2024)

ConfigSpec: Profiling-Based Configuration Selection for Distributed Edge--Cloud Speculative LLM Serving
by: Li, Xiangchen, et al.
Published: (2026)

SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving
by: Li, Xiangchen, et al.
Published: (2025)

Unlocking True Elasticity for the Cloud-Native Era with Dandelion
by: Kuchler, Tom, et al.
Published: (2025)

Low-Latency Privacy-Preserving Deep Learning Design via Secure MPC
by: Lin, Ke, et al.
Published: (2024)

Optimizing CNN Using HPC Tools
by: Rahman, Shahrin
Published: (2024)

NaturalTurn: A Method to Segment Speech into Psychologically Meaningful Conversational Turns
by: Cooney, Gus, et al.
Published: (2024)

SuperSFL: Resource-Heterogeneous Federated Split Learning with Weight-Sharing Super-Networks
by: Asif, Abdullah Al, et al.
Published: (2026)

Leveraging Core and Uncore Frequency Scaling for Power-Efficient Serverless Workflows
by: Tzenetopoulos, Achilleas, et al.
Published: (2024)

Future Mining: Learning for Safety and Security
by: Rahman, Md Sazedur, et al.
Published: (2026)

Ichnos: A Carbon Footprint Estimator for Scientific Workflows
by: West, Kathleen, et al.
Published: (2024)

Stream-K Optimization and Exploration
by: Rackley, Nick, et al.
Published: (2024)

A Survey on Scheduling Techniques in the Edge Cloud: Issues, Challenges and Future Directions
by: Asghar, Hassan, et al.
Published: (2022)

CurvFed: Curvature-Aligned Federated Learning for Fairness without Demographics
by: Sharma, Harshit, et al.
Published: (2024)

Enabling Scalability in Asynchronous and Bidirectional Communication in LPWAN
by: Rahman, Mahbubur
Published: (2025)

HeLoCo: Efficient asynchronous low-communication training under data and device heterogeneity
by: Asif, Abdullah Al, et al.
Published: (2026)

Cooperative Gradient Coding
by: Weng, Shudi, et al.
Published: (2025)

SmartPQ: An Adaptive Concurrent Priority Queue for NUMA Architectures
by: Giannoula, Christina, et al.
Published: (2024)

Code once, Run Green: Automated Green Code Translation in Serverless Computing
by: Werner, Sebastian, et al.
Published: (2025)