Saved in:
| Main Authors: | Varol, Aygün, Motlagh, Naser Hossein, Leino, Mirka, Tarkoma, Sasu, Virkki, Johanna |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.14708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CloudSim 7G: An Integrated Toolkit for Modeling and Simulation of Future Generation Cloud Computing Environments
by: Andreoli, Remo, et al.
Published: (2024)
by: Andreoli, Remo, et al.
Published: (2024)
MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Quantization
by: Wang, Zongwu, et al.
Published: (2025)
by: Wang, Zongwu, et al.
Published: (2025)
Flash-SD-KDE: Accelerating SD-KDE with Tensor Cores
by: Epstein, Elliot L., et al.
Published: (2026)
by: Epstein, Elliot L., et al.
Published: (2026)
Follow-Me AI: Energy-Efficient User Interaction with Smart Environments
by: Saleh, Alaa, et al.
Published: (2024)
by: Saleh, Alaa, et al.
Published: (2024)
Scaling Point-based Differentiable Rendering for Large-scale Reconstruction
by: Zhao, Hexu, et al.
Published: (2025)
by: Zhao, Hexu, et al.
Published: (2025)
Splitwise: Efficient generative LLM inference using phase splitting
by: Patel, Pratyush, et al.
Published: (2023)
by: Patel, Pratyush, et al.
Published: (2023)
Neural Router: Semantic Content Matching for Agentic AI
by: Lovén, Lauri, et al.
Published: (2026)
by: Lovén, Lauri, et al.
Published: (2026)
Why does Prediction Accuracy Decrease over Time? Uncertain Positive Learning for Cloud Failure Prediction
by: Li, Haozhe, et al.
Published: (2024)
by: Li, Haozhe, et al.
Published: (2024)
Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference
by: Stojkovic, Jovan, et al.
Published: (2024)
by: Stojkovic, Jovan, et al.
Published: (2024)
Towards Message Brokers for Generative AI: Survey, Challenges, and Opportunities
by: Saleh, Alaa, et al.
Published: (2023)
by: Saleh, Alaa, et al.
Published: (2023)
Smart Space Environments: Key Challenges and Innovative Solutions
by: Kumar, Ramakant
Published: (2024)
by: Kumar, Ramakant
Published: (2024)
UserCentrix: An Agentic Memory-augmented AI Framework for Smart Spaces
by: Saleh, Alaa, et al.
Published: (2025)
by: Saleh, Alaa, et al.
Published: (2025)
On the Effectiveness of the 'Follow-the-Sun' Strategy in Mitigating the Carbon Footprint of AI in Cloud Instances
by: Vergallo, Roberto, et al.
Published: (2025)
by: Vergallo, Roberto, et al.
Published: (2025)
PANDORA: A Parallel Dendrogram Construction Algorithm for Single Linkage Clustering on GPU
by: Sao, Piyush, et al.
Published: (2024)
by: Sao, Piyush, et al.
Published: (2024)
Energy-Efficient Split Learning for Resource-Constrained Environments: A Smart Farming Solution
by: Soltani, Keiwan, et al.
Published: (2025)
by: Soltani, Keiwan, et al.
Published: (2025)
Roadmap for Edge AI: A Dagstuhl Perspective
by: Ding, Aaron Yi, et al.
Published: (2021)
by: Ding, Aaron Yi, et al.
Published: (2021)
Representation Similarity: A Better Guidance of DNN Layer Sharing for Edge Computing without Training
by: Cao, Bryan Bo, et al.
Published: (2024)
by: Cao, Bryan Bo, et al.
Published: (2024)
Scalable Machine Learning Training Infrastructure for Online Ads Recommendation and Auction Scoring Modeling at Google
by: Kurian, George, et al.
Published: (2025)
by: Kurian, George, et al.
Published: (2025)
A Survey on Model-heterogeneous Federated Learning: Problems, Methods, and Prospects
by: Fan, Boyu, et al.
Published: (2023)
by: Fan, Boyu, et al.
Published: (2023)
MLP-Offload: Multi-Level, Multi-Path Offloading for LLM Pre-training to Break the GPU Memory Wall
by: Maurya, Avinash, et al.
Published: (2025)
by: Maurya, Avinash, et al.
Published: (2025)
Cross-Platform Fused MoE Dispatch in Triton: Portable Expert Routing Without CUDA
by: Mitra, Subhadip
Published: (2026)
by: Mitra, Subhadip
Published: (2026)
Bio-inspired Agentic Self-healing Framework for Resilient Distributed Computing Continuum Systems
by: Saleh, Alaa, et al.
Published: (2026)
by: Saleh, Alaa, et al.
Published: (2026)
Trustworthy Second-hand Marketplace for Built Environment
by: Wilson, Stanly, et al.
Published: (2025)
by: Wilson, Stanly, et al.
Published: (2025)
A trustless society? A political look at the blockchain vision
by: Rehak, Rainer
Published: (2024)
by: Rehak, Rainer
Published: (2024)
A Selective Homomorphic Encryption Approach for Faster Privacy-Preserving Federated Learning
by: Korkmaz, Abdulkadir, et al.
Published: (2025)
by: Korkmaz, Abdulkadir, et al.
Published: (2025)
GREEN-CODE: Learning to Optimize Energy Efficiency in LLM-based Code Generation
by: Ilager, Shashikant, et al.
Published: (2025)
by: Ilager, Shashikant, et al.
Published: (2025)
A reliability- and latency-driven task allocation framework for workflow applications in the edge-hub-cloud continuum
by: Kouloumpris, Andreas, et al.
Published: (2026)
by: Kouloumpris, Andreas, et al.
Published: (2026)
Enabling SSI-Compliant Use of EUDI Wallet Credentials through Trusted Execution Environment and Zero-Knowledge Proof
by: Sitouah, Nacereddine, et al.
Published: (2026)
by: Sitouah, Nacereddine, et al.
Published: (2026)
Efficient Construction of Large Search Spaces for Auto-Tuning
by: Willemsen, Floris-Jan, et al.
Published: (2025)
by: Willemsen, Floris-Jan, et al.
Published: (2025)
Distributed Simulation of Large Multi-body Systems
by: Kale, Manas, et al.
Published: (2024)
by: Kale, Manas, et al.
Published: (2024)
ATLAHS: An Application-centric Network Simulator Toolchain for AI, HPC, and Distributed Storage
by: Shen, Siyuan, et al.
Published: (2025)
by: Shen, Siyuan, et al.
Published: (2025)
Federated Domain Generalization with Data-free On-server Matching Gradient
by: Nguyen, Trong-Binh, et al.
Published: (2025)
by: Nguyen, Trong-Binh, et al.
Published: (2025)
Planetary computing for data-driven environmental policy-making
by: Ferris, Patrick, et al.
Published: (2023)
by: Ferris, Patrick, et al.
Published: (2023)
Comparison of Autoscaling Frameworks for Containerised Machine-Learning-Applications in a Local and Cloud Environment
by: Schroeder, Christian, et al.
Published: (2023)
by: Schroeder, Christian, et al.
Published: (2023)
Deep Learning Model Deployment in Multiple Cloud Providers: an Exploratory Study Using Low Computing Power Environments
by: Lemos, Elayne, et al.
Published: (2025)
by: Lemos, Elayne, et al.
Published: (2025)
Harnessing Data Spaces to Build Intelligent Smart City Infrastructures Across the Cloud-Edge Continuum
by: Amaxilatis, Dimitrios, et al.
Published: (2025)
by: Amaxilatis, Dimitrios, et al.
Published: (2025)
AMP4EC: Adaptive Model Partitioning Framework for Efficient Deep Learning Inference in Edge Computing Environments
by: Zhang, Guilin, et al.
Published: (2025)
by: Zhang, Guilin, et al.
Published: (2025)
An HPC Benchmark Survey and Taxonomy for Characterization
by: Herten, Andreas, et al.
Published: (2025)
by: Herten, Andreas, et al.
Published: (2025)
HybridFlow: A Flexible and Efficient RLHF Framework
by: Sheng, Guangming, et al.
Published: (2024)
by: Sheng, Guangming, et al.
Published: (2024)
Stream-K++: Adaptive GPU GEMM Kernel Scheduling and Selection using Bloom Filters
by: Sadasivan, Harisankar, et al.
Published: (2024)
by: Sadasivan, Harisankar, et al.
Published: (2024)
Similar Items
-
CloudSim 7G: An Integrated Toolkit for Modeling and Simulation of Future Generation Cloud Computing Environments
by: Andreoli, Remo, et al.
Published: (2024) -
MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Quantization
by: Wang, Zongwu, et al.
Published: (2025) -
Flash-SD-KDE: Accelerating SD-KDE with Tensor Cores
by: Epstein, Elliot L., et al.
Published: (2026) -
Follow-Me AI: Energy-Efficient User Interaction with Smart Environments
by: Saleh, Alaa, et al.
Published: (2024) -
Scaling Point-based Differentiable Rendering for Large-scale Reconstruction
by: Zhao, Hexu, et al.
Published: (2025)