Saved in:
| Main Author: | Barros, Sebastian |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.03708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Lightweight Latency Prediction Scheme for Edge Applications: A Rational Modelling Approach
by: Liyanage, Mohan, et al.
Published: (2025)
by: Liyanage, Mohan, et al.
Published: (2025)
Short-circuiting Rings for Low-Latency AllReduce
by: Hammer, Sarah-Michelle, et al.
Published: (2025)
by: Hammer, Sarah-Michelle, et al.
Published: (2025)
RouterWise: Joint Resource Allocation and Routing for Latency-Aware Multi-Model LLM Serving
by: Kasnavieh, Hossein Hosseini, et al.
Published: (2026)
by: Kasnavieh, Hossein Hosseini, et al.
Published: (2026)
Trivance: Latency-Optimal AllReduce by Shortcutting Multiport Networks
by: Juerss, Anton, et al.
Published: (2026)
by: Juerss, Anton, et al.
Published: (2026)
Risk-Aware and Stable Edge Server Selection Under Network Latency SLOs
by: Liyanage, Mohan, et al.
Published: (2026)
by: Liyanage, Mohan, et al.
Published: (2026)
Network Anomaly Detection in Distributed Edge Computing Infrastructure
by: Marfo, William, et al.
Published: (2025)
by: Marfo, William, et al.
Published: (2025)
GENIO: Synergizing Edge Computing with Optical Network Infrastructures
by: Cesarano, Carmine, et al.
Published: (2025)
by: Cesarano, Carmine, et al.
Published: (2025)
COREC: Concurrent Non-Blocking Single-Queue Receive Driver for Low Latency Networking
by: Faltelli, Marco, et al.
Published: (2024)
by: Faltelli, Marco, et al.
Published: (2024)
Uber's Failover Architecture: Reconciling Reliability and Efficiency in Hyperscale Microservice Infrastructure
by: Bansal, Mayank, et al.
Published: (2026)
by: Bansal, Mayank, et al.
Published: (2026)
CCL-Bench 1.0: A Trace-Based Benchmark for LLM Infrastructure
by: Ding, Eric, et al.
Published: (2026)
by: Ding, Eric, et al.
Published: (2026)
GORGO: Maximizing KV-Cache Reuse While Minimizing Network Latency in Cross-Region LLM Load Balancing
by: Toniolo, Alessio Ricci, et al.
Published: (2026)
by: Toniolo, Alessio Ricci, et al.
Published: (2026)
1.5 Million Messages Per Second on 3 Machines: Benchmarking and Latency Optimization of Apache Pulsar at Enterprise Scale
by: Mukkolakkal, Muhamed Ramees Cheriya
Published: (2026)
by: Mukkolakkal, Muhamed Ramees Cheriya
Published: (2026)
Self-Healing Network of Interconnected Edge Devices Empowered by Infrastructure-as-Code and LoRa Communication
by: Carson, Rob, et al.
Published: (2025)
by: Carson, Rob, et al.
Published: (2025)
CRAFT: Latency and Cost-Aware Genetic-Based Framework for Node Placement in Edge-Fog Environments
by: Mahdizadeh, Soheil, et al.
Published: (2025)
by: Mahdizadeh, Soheil, et al.
Published: (2025)
Surviving the Edge: Federated Learning under Networking and Resource Constraints
by: Mwanje, Mike, et al.
Published: (2026)
by: Mwanje, Mike, et al.
Published: (2026)
EdgeLoc: A Communication-Adaptive Parallel System for Real-Time Localization in Infrastructure-Assisted Autonomous Driving
by: Liu, Boyi, et al.
Published: (2024)
by: Liu, Boyi, et al.
Published: (2024)
FogROS2-PLR: Probabilistic Latency-Reliability For Cloud Robotics
by: Chen, Kaiyuan, et al.
Published: (2024)
by: Chen, Kaiyuan, et al.
Published: (2024)
Joint Partitioning and Placement of Foundation Models for Real-Time Edge AI
by: Djuhera, Aladin, et al.
Published: (2025)
by: Djuhera, Aladin, et al.
Published: (2025)
Decentralized Stratified Sampling for Low-Latency Approximate Geospatial Data Stream Processing in Edge-Cloud Architectures
by: Jawarneh, Isam Mashhour Al, et al.
Published: (2026)
by: Jawarneh, Isam Mashhour Al, et al.
Published: (2026)
Reliable Image Transmission in CPS-based Pub/Sub
by: Flores, Everson, et al.
Published: (2025)
by: Flores, Everson, et al.
Published: (2025)
NET4EXA: Pioneering the Future of Interconnects for Supercomputing and AI
by: Martinelli, Michele, et al.
Published: (2026)
by: Martinelli, Michele, et al.
Published: (2026)
Dynamic Edge Server Selection in Time-Varying Environments: A Reliability-Aware Predictive Approach
by: Burbano, Jaime Sebastian, et al.
Published: (2025)
by: Burbano, Jaime Sebastian, et al.
Published: (2025)
FlowTracer: A Tool for Uncovering Network Path Usage Imbalance in AI Training Clusters
by: Jamil, Hasibul, et al.
Published: (2024)
by: Jamil, Hasibul, et al.
Published: (2024)
A Task Decomposition and Planning Framework for Efficient LLM Inference in AI-Enabled WiFi-Offload Networks
by: Han, Mingqi, et al.
Published: (2026)
by: Han, Mingqi, et al.
Published: (2026)
SAKURAONE: An Open Ethernet-Based AI HPC System and Its Observed Workload Dynamics in a Single-Tenant LLM Development Environment
by: Konishi, Fumikazu, et al.
Published: (2026)
by: Konishi, Fumikazu, et al.
Published: (2026)
Optimizing Split Learning Latency in TinyML-Based IoT Systems
by: Jenhani, Zied, et al.
Published: (2025)
by: Jenhani, Zied, et al.
Published: (2025)
Diffusion Models on the Edge: Challenges, Optimizations, and Applications
by: Zheng, Dongqi
Published: (2025)
by: Zheng, Dongqi
Published: (2025)
A New Broadcast Model for Several Network Topologies
by: Lu, Hongbo, et al.
Published: (2025)
by: Lu, Hongbo, et al.
Published: (2025)
Federated Learning and Evolutionary Game Model for Fog Federation Formation
by: Yasser, Zyad, et al.
Published: (2024)
by: Yasser, Zyad, et al.
Published: (2024)
Resource Allocation Driven by Large Models in Future Semantic-Aware Networks
by: Zhang, Haijun, et al.
Published: (2025)
by: Zhang, Haijun, et al.
Published: (2025)
To Stream or Not to Stream: Towards A Quantitative Model for Remote HPC Processing Decisions
by: Castro, Flavio, et al.
Published: (2025)
by: Castro, Flavio, et al.
Published: (2025)
ncsim: A Lightweight Simulator for Networked Edge Computing with Wireless Interference Modeling
by: Krishnamachari, Bhaskar, et al.
Published: (2026)
by: Krishnamachari, Bhaskar, et al.
Published: (2026)
Performance Analysis of Internet of Vehicles Mesh Networks Based on Actual Switch Models
by: Hu, Jialin, et al.
Published: (2024)
by: Hu, Jialin, et al.
Published: (2024)
Toward Edge General Intelligence with Multiple-Large Language Model (Multi-LLM): Architecture, Trust, and Orchestration
by: Luo, Haoxiang, et al.
Published: (2025)
by: Luo, Haoxiang, et al.
Published: (2025)
Low-Latency Video Conferencing via Optimized Packet Routing and Reordering
by: Xiao, Yao, et al.
Published: (2023)
by: Xiao, Yao, et al.
Published: (2023)
Satellite Federated Fine-Tuning for Foundation Models in Space Computing Power Networks
by: Zhu, Yan, et al.
Published: (2025)
by: Zhu, Yan, et al.
Published: (2025)
Over-the-Top Resource Broker System for Split Computing: An Approach to Distribute Cloud Computing Infrastructure
by: Friese, Ingo, et al.
Published: (2025)
by: Friese, Ingo, et al.
Published: (2025)
A Uniqueness Theorem for Distributed Computation under Physical Constraint
by: Ren, Zhiyuan, et al.
Published: (2025)
by: Ren, Zhiyuan, et al.
Published: (2025)
Fast Multichannel Topology Discovery in Cognitive Radio Networks
by: Wang, Yung-Li, et al.
Published: (2025)
by: Wang, Yung-Li, et al.
Published: (2025)
Enabling Scalability in Asynchronous and Bidirectional Communication in LPWAN
by: Rahman, Mahbubur
Published: (2025)
by: Rahman, Mahbubur
Published: (2025)
Similar Items
-
Lightweight Latency Prediction Scheme for Edge Applications: A Rational Modelling Approach
by: Liyanage, Mohan, et al.
Published: (2025) -
Short-circuiting Rings for Low-Latency AllReduce
by: Hammer, Sarah-Michelle, et al.
Published: (2025) -
RouterWise: Joint Resource Allocation and Routing for Latency-Aware Multi-Model LLM Serving
by: Kasnavieh, Hossein Hosseini, et al.
Published: (2026) -
Trivance: Latency-Optimal AllReduce by Shortcutting Multiport Networks
by: Juerss, Anton, et al.
Published: (2026) -
Risk-Aware and Stable Edge Server Selection Under Network Latency SLOs
by: Liyanage, Mohan, et al.
Published: (2026)