Saved in:
| Main Authors: | Khalilov, Mikhail, Shen, Siyuan, Chrapek, Marcin, Chen, Tiancheng, Nakano, Kenji, Gootzen, Peter-Jan, Di Girolamo, Salvatore, Nudelman, Rami, Bloch, Gil, Anantharamu, Sreevatsa, Elhaddad, Mahmoud, Jose, Jithin, Kabbani, Abdul, Moe, Scott, Taranov, Konstantin, Yu, Zhuolong, Zhang, Jie, Mazzoletti, Nicola, Hoefler, Torsten |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.05366 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI
by: Khalilov, Mikhail, et al.
Published: (2024)
by: Khalilov, Mikhail, et al.
Published: (2024)
A Non‐Dissipative, Energy‐Conserving, Arbitrary High‐Order Numerical Method and Its Efficient Implementation for Incompressible Flow Simulation in Complex Geometries
by: Sreevatsa Anantharamu, et al.
Published: (2024)
by: Sreevatsa Anantharamu, et al.
Published: (2024)
Uno: A One-Stop Solution for Inter- and Intra-Datacenter Congestion Control and Reliable Connectivity
by: Bonato, Tommaso, et al.
Published: (2025)
by: Bonato, Tommaso, et al.
Published: (2025)
Understanding Data Movement in Tightly Coupled Heterogeneous Systems: A Case Study with the Grace Hopper Superchip
by: Fusco, Luigi, et al.
Published: (2024)
by: Fusco, Luigi, et al.
Published: (2024)
Hazel: Secure and Efficient Disaggregated Storage
by: Chrapek, Marcin, et al.
Published: (2025)
by: Chrapek, Marcin, et al.
Published: (2025)
OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICs
by: Khalilov, Mikhail, et al.
Published: (2023)
by: Khalilov, Mikhail, et al.
Published: (2023)
FedRDMA: Communication-Efficient Cross-Silo Federated LLM via Chunked RDMA Transmission
by: Zhang, Zeling, et al.
Published: (2024)
by: Zhang, Zeling, et al.
Published: (2024)
Reimagining RDMA Through the Lens of ML
by: Warraich, Ertza, et al.
Published: (2025)
by: Warraich, Ertza, et al.
Published: (2025)
Orderly Management of Packets in RDMA by Eunomia
by: Mahmood, Sana, et al.
Published: (2024)
by: Mahmood, Sana, et al.
Published: (2024)
Confidential LLM Inference: Performance and Cost Across CPU and GPU TEEs
by: Chrapek, Marcin, et al.
Published: (2025)
by: Chrapek, Marcin, et al.
Published: (2025)
Specifying and Verifying RDMA Synchronisation (Extended Version)
by: Ambal, Guillaume, et al.
Published: (2026)
by: Ambal, Guillaume, et al.
Published: (2026)
ALock: Asymmetric Lock Primitive for RDMA Systems
by: Baran, Amanda, et al.
Published: (2024)
by: Baran, Amanda, et al.
Published: (2024)
Faster Offloads by Unloading them -- The RDMA Case
by: Fragkouli, Georgia, et al.
Published: (2025)
by: Fragkouli, Georgia, et al.
Published: (2025)
MatchRDMA: A Segmented and Rate-Matched Long-Haul RDMA Scheme for Geo-distributed LLM Training over OTN
by: Dai, Jun, et al.
Published: (2026)
by: Dai, Jun, et al.
Published: (2026)
FaaSKeeper: Learning from Building Serverless Services with ZooKeeper as an Example
by: Copik, Marcin, et al.
Published: (2022)
by: Copik, Marcin, et al.
Published: (2022)
Towards Efficient and Scalable Distributed Vector Search with RDMA
by: Zhi, Xiangyu, et al.
Published: (2025)
by: Zhi, Xiangyu, et al.
Published: (2025)
Swift: Rethinking RDMA Control Plane for Elastic Computing
by: Zhang, Junxue, et al.
Published: (2025)
by: Zhang, Junxue, et al.
Published: (2025)
Thallus: An RDMA-based Columnar Data Transport Protocol
by: Chakraborty, Jayjeet, et al.
Published: (2024)
by: Chakraborty, Jayjeet, et al.
Published: (2024)
RDMA-Based Algorithms for Sparse Matrix Multiplication on GPUs
by: Brock, Benjamin, et al.
Published: (2023)
by: Brock, Benjamin, et al.
Published: (2023)
SHIFT: Exploring the Boundary of RDMA Network Fault Tolerance
by: Lin, Shengkai, et al.
Published: (2025)
by: Lin, Shengkai, et al.
Published: (2025)
Varuna: Enabling Failure-Type Aware RDMA Failover
by: Wang, Xiaoyang, et al.
Published: (2026)
by: Wang, Xiaoyang, et al.
Published: (2026)
Software Resource Disaggregation for HPC with Serverless Computing
by: Copik, Marcin, et al.
Published: (2024)
by: Copik, Marcin, et al.
Published: (2024)
Validation of a Software-Defined 100-Gb/s RDMA Streaming Architecture for Ultrafast Optoacoustic and Ultrasound Imaging
by: Villani, Federico, et al.
Published: (2026)
by: Villani, Federico, et al.
Published: (2026)
EDAN: Towards Understanding Memory Parallelism and Latency Sensitivity in HPC
by: Shen, Siyuan, et al.
Published: (2025)
by: Shen, Siyuan, et al.
Published: (2025)
An RDMA-First Object Storage System with SmartNIC Offload
by: Zhu, Yu, et al.
Published: (2025)
by: Zhu, Yu, et al.
Published: (2025)
fabric-lib: RDMA Point-to-Point Communication for LLM Systems
by: Licker, Nandor, et al.
Published: (2025)
by: Licker, Nandor, et al.
Published: (2025)
The Semantic Arrow of Time, Part III: RDMA and the Completion Fallacy
by: Borrill, Paul
Published: (2026)
by: Borrill, Paul
Published: (2026)
Handling of Memory Page Faults during Virtual-Address RDMA
by: Psistakis, Antonis
Published: (2025)
by: Psistakis, Antonis
Published: (2025)
Noisy Neighbor: Exploiting RDMA for Resource Exhaustion Attacks in Containerized Clouds
by: Kim, Gunwoo, et al.
Published: (2025)
by: Kim, Gunwoo, et al.
Published: (2025)
Securing High-Performance Data Transfers: Implementing AES Encryption in RDMA Systems
by: Bångsbo, Erik, et al.
Published: (2026)
by: Bångsbo, Erik, et al.
Published: (2026)
Closing the HPC-Cloud Convergence Gap: Multi-Tenant Slingshot RDMA for Kubernetes
by: Friese, Philipp A., et al.
Published: (2025)
by: Friese, Philipp A., et al.
Published: (2025)
RoCE BALBOA: Service-enhanced Data Center RDMA for SmartNICs
by: Heer, Maximilian Jakob, et al.
Published: (2025)
by: Heer, Maximilian Jakob, et al.
Published: (2025)
Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
by: Chrapek, Marcin, et al.
Published: (2024)
by: Chrapek, Marcin, et al.
Published: (2024)
OptiNIC: A Resilient and Tail-Optimal RDMA NIC for Distributed ML Workloads
by: Warraich, Ertza, et al.
Published: (2025)
by: Warraich, Ertza, et al.
Published: (2025)
Benchmarking Filtered Approximate Nearest Neighbor Search Algorithms on Transformer-based Embedding Vectors
by: Iff, Patrick, et al.
Published: (2025)
by: Iff, Patrick, et al.
Published: (2025)
RDMA: Cost Effective Agent-Driven Rare Disease Mining from Electronic Health Records
by: Wu, John, et al.
Published: (2025)
by: Wu, John, et al.
Published: (2025)
FPGA-Based RoCEv2-RDMA Readout Electronics for the CTAO-LST Advanced Camera
by: Marini, F., et al.
Published: (2025)
by: Marini, F., et al.
Published: (2025)
LCMP: Distributed Long-Haul Cost-Aware Multi-Path Routing for Inter-Datacenter RDMA Networks
by: Yu, Dong-Yang, et al.
Published: (2026)
by: Yu, Dong-Yang, et al.
Published: (2026)
REPS: Recycled Entropy Packet Spraying for Adaptive Load Balancing and Failure Mitigation
by: Bonato, Tommaso, et al.
Published: (2024)
by: Bonato, Tommaso, et al.
Published: (2024)
OnePiece: A Large-Scale Distributed Inference System with RDMA for Complex AI-Generated Content (AIGC) Workflows
by: Chen, June, et al.
Published: (2026)
by: Chen, June, et al.
Published: (2026)
Similar Items
-
Network-Offloaded Bandwidth-Optimal Broadcast and Allgather for Distributed AI
by: Khalilov, Mikhail, et al.
Published: (2024) -
A Non‐Dissipative, Energy‐Conserving, Arbitrary High‐Order Numerical Method and Its Efficient Implementation for Incompressible Flow Simulation in Complex Geometries
by: Sreevatsa Anantharamu, et al.
Published: (2024) -
Uno: A One-Stop Solution for Inter- and Intra-Datacenter Congestion Control and Reliable Connectivity
by: Bonato, Tommaso, et al.
Published: (2025) -
Understanding Data Movement in Tightly Coupled Heterogeneous Systems: A Case Study with the Grace Hopper Superchip
by: Fusco, Luigi, et al.
Published: (2024) -
Hazel: Secure and Efficient Disaggregated Storage
by: Chrapek, Marcin, et al.
Published: (2025)