Saved in:
| Main Authors: | Dai, Jun, Wang, Xiaorun, Fang, Kexiong, Yang, Zheng, Ji, Yuefeng, Zhang, Jiawei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.12064 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MatchRDMA: A Segmented and Rate-Matched Long-Haul RDMA Scheme for Geo-distributed LLM Training over OTN
by: Dai, Jun, et al.
Published: (2026)
by: Dai, Jun, et al.
Published: (2026)
LCMP: Distributed Long-Haul Cost-Aware Multi-Path Routing for Inter-Datacenter RDMA Networks
by: Yu, Dong-Yang, et al.
Published: (2026)
by: Yu, Dong-Yang, et al.
Published: (2026)
CollaPipe: Adaptive Segment-Optimized Pipeline Parallelism for Collaborative LLM Training in Heterogeneous Edge Networks
by: Chen, Jiewei, et al.
Published: (2025)
by: Chen, Jiewei, et al.
Published: (2025)
RoCE BALBOA: Service-enhanced Data Center RDMA for SmartNICs
by: Heer, Maximilian Jakob, et al.
Published: (2025)
by: Heer, Maximilian Jakob, et al.
Published: (2025)
SDR-RDMA: Software-Defined Reliability Architecture for Planetary Scale RDMA Communication
by: Khalilov, Mikhail, et al.
Published: (2025)
by: Khalilov, Mikhail, et al.
Published: (2025)
Designing Transport-Level Encryption for Datacenter Networks
by: Gao, Tianyi, et al.
Published: (2024)
by: Gao, Tianyi, et al.
Published: (2024)
SIRD: A Sender-Informed, Receiver-Driven Datacenter Transport Protocol
by: Prasopoulos, Konstantinos, et al.
Published: (2023)
by: Prasopoulos, Konstantinos, et al.
Published: (2023)
Faster Offloads by Unloading them -- The RDMA Case
by: Fragkouli, Georgia, et al.
Published: (2025)
by: Fragkouli, Georgia, et al.
Published: (2025)
Swift: Rethinking RDMA Control Plane for Elastic Computing
by: Zhang, Junxue, et al.
Published: (2025)
by: Zhang, Junxue, et al.
Published: (2025)
SHIFT: Exploring the Boundary of RDMA Network Fault Tolerance
by: Lin, Shengkai, et al.
Published: (2025)
by: Lin, Shengkai, et al.
Published: (2025)
Photonic Rails in ML Datacenters
by: Ding, Eric, et al.
Published: (2025)
by: Ding, Eric, et al.
Published: (2025)
FedRDMA: Communication-Efficient Cross-Silo Federated LLM via Chunked RDMA Transmission
by: Zhang, Zeling, et al.
Published: (2024)
by: Zhang, Zeling, et al.
Published: (2024)
Photonic Rails in ML Datacenters with Opus
by: Ding, Eric, et al.
Published: (2026)
by: Ding, Eric, et al.
Published: (2026)
RNG: Flat Datacenter Networks at Scale
by: Bernardi, Giacomo, et al.
Published: (2026)
by: Bernardi, Giacomo, et al.
Published: (2026)
CBA: Communication-Bound-Aware Cross-Domain Resource Assignment for Pipeline-Parallel Distributed LLM Training in Dynamic Multi-DC Optical Networks
by: Fu, Dianxuan, et al.
Published: (2025)
by: Fu, Dianxuan, et al.
Published: (2025)
Orderly Management of Packets in RDMA by Eunomia
by: Mahmood, Sana, et al.
Published: (2024)
by: Mahmood, Sana, et al.
Published: (2024)
UB-Mesh: a Hierarchically Localized nD-FullMesh Datacenter Network Architecture
by: Liao, Heng, et al.
Published: (2025)
by: Liao, Heng, et al.
Published: (2025)
Understanding the Throughput Bounds of Reconfigurable Datacenter Networks
by: Addanki, Vamsi, et al.
Published: (2024)
by: Addanki, Vamsi, et al.
Published: (2024)
D3: An Adaptive Reconfigurable Datacenter Network
by: Zerwas, Johannes, et al.
Published: (2024)
by: Zerwas, Johannes, et al.
Published: (2024)
SDT: Cutting Datacenter Tax Through Simultaneous Data-Delivery Threads
by: Mamandipoor, Amin, et al.
Published: (2025)
by: Mamandipoor, Amin, et al.
Published: (2025)
Reimagining RDMA Through the Lens of ML
by: Warraich, Ertza, et al.
Published: (2025)
by: Warraich, Ertza, et al.
Published: (2025)
Scheduling Parallel Optical Circuit Switches for AI Training
by: Liang, Kevin, et al.
Published: (2026)
by: Liang, Kevin, et al.
Published: (2026)
Palladium: A DPU-enabled Multi-Tenant Serverless Cloud over Zero-copy Multi-node RDMA Fabrics
by: Qi, Shixiong, et al.
Published: (2025)
by: Qi, Shixiong, et al.
Published: (2025)
An End-to-End Assurance Framework for AI/ML Workloads in Datacenters
by: Gupta, Jit, et al.
Published: (2025)
by: Gupta, Jit, et al.
Published: (2025)
Avoiding Cross-Datacenter Collective Congestion via Disaggregated Buffering
by: Scazzariello, Mariano, et al.
Published: (2026)
by: Scazzariello, Mariano, et al.
Published: (2026)
Poster: Flexible Scheduling of Network and Computing Resources for Distributed AI Tasks
by: Wang, Ruikun, et al.
Published: (2024)
by: Wang, Ruikun, et al.
Published: (2024)
Multi-Failure Localization in High-Degree ROADM-based Optical Networks using Rules-Informed Neural Networks
by: Wang, Ruikun, et al.
Published: (2025)
by: Wang, Ruikun, et al.
Published: (2025)
Noisy Neighbor: Exploiting RDMA for Resource Exhaustion Attacks in Containerized Clouds
by: Kim, Gunwoo, et al.
Published: (2025)
by: Kim, Gunwoo, et al.
Published: (2025)
Bring Your Own Objective: Inter-operability of Network Objectives in Datacenters
by: Narang, Sanjoli, et al.
Published: (2026)
by: Narang, Sanjoli, et al.
Published: (2026)
Flow Optimization at Inter-Datacenter Networks for Application Run-time Acceleration
by: Serracanta, Berta, et al.
Published: (2024)
by: Serracanta, Berta, et al.
Published: (2024)
BShare: Packet Queueing Delay-Driven Buffer Sharing for Datacenter Switches
by: Agarwal, Krishna, et al.
Published: (2026)
by: Agarwal, Krishna, et al.
Published: (2026)
Trusted Repeater Placement in QKD-enabled Optical Networks
by: Marik, Arup Kumar, et al.
Published: (2025)
by: Marik, Arup Kumar, et al.
Published: (2025)
Validation of a Software-Defined 100-Gb/s RDMA Streaming Architecture for Ultrafast Optoacoustic and Ultrasound Imaging
by: Villani, Federico, et al.
Published: (2026)
by: Villani, Federico, et al.
Published: (2026)
Securing High-Performance Data Transfers: Implementing AES Encryption in RDMA Systems
by: Bångsbo, Erik, et al.
Published: (2026)
by: Bångsbo, Erik, et al.
Published: (2026)
Zeropod: Simplifying Datacenter Networking with Future-Proof Zero-Buffer Packet Switches
by: Liang, Cong, et al.
Published: (2021)
by: Liang, Cong, et al.
Published: (2021)
Elevating the future of mobility: UAV-enabled Intelligent Transportation Systems
by: Saboor, Abdul, et al.
Published: (2021)
by: Saboor, Abdul, et al.
Published: (2021)
Varuna: Enabling Failure-Type Aware RDMA Failover
by: Wang, Xiaoyang, et al.
Published: (2026)
by: Wang, Xiaoyang, et al.
Published: (2026)
Learning-based Two-tiered Online Optimization of Region-wide Datacenter Resource Allocation
by: Chen, Chang-Lin, et al.
Published: (2023)
by: Chen, Chang-Lin, et al.
Published: (2023)
NegotiaToR: Towards A Simple Yet Effective On-demand Reconfigurable Datacenter Network
by: Liang, Cong, et al.
Published: (2024)
by: Liang, Cong, et al.
Published: (2024)
InfiniteHBD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers
by: Shou, Chenchen, et al.
Published: (2025)
by: Shou, Chenchen, et al.
Published: (2025)
Similar Items
-
MatchRDMA: A Segmented and Rate-Matched Long-Haul RDMA Scheme for Geo-distributed LLM Training over OTN
by: Dai, Jun, et al.
Published: (2026) -
LCMP: Distributed Long-Haul Cost-Aware Multi-Path Routing for Inter-Datacenter RDMA Networks
by: Yu, Dong-Yang, et al.
Published: (2026) -
CollaPipe: Adaptive Segment-Optimized Pipeline Parallelism for Collaborative LLM Training in Heterogeneous Edge Networks
by: Chen, Jiewei, et al.
Published: (2025) -
RoCE BALBOA: Service-enhanced Data Center RDMA for SmartNICs
by: Heer, Maximilian Jakob, et al.
Published: (2025) -
SDR-RDMA: Software-Defined Reliability Architecture for Planetary Scale RDMA Communication
by: Khalilov, Mikhail, et al.
Published: (2025)