Remote Direct Memory Access (RDMA)
Last updated
Was this helpful?
Last updated
Was this helpful?
X-RDMA: Effective RDMA Middleware in Large-scale Production Environments () []
Alibaba
Focus on robustness, scalability, and maintainability.
FreeFlow: Software-based Virtual RDMA Networking for Containerized Clouds () [] []
CMU & Microsoft & Alibaba & ByteDance
A software-based RDMA virtualization framework designed for containerized clouds.
Revisiting Network Support for RDMA () [] []
UC Berkeley & ICSI & Mellanox & NYU & UW
IRN: Better handling of packet losses; eliminate the need for PFC.
RDMA over Commodity Ethernet at Scale (SIGCOMM 2016) []
Microsoft
Challenges using RoCEv2; a DSCP (Differentiated Services Code Point) based PFC mechanism.
Congestion Control for Large-Scale RDMA Deployments (SIGCOMM 2015) []
Microsoft & Mellanox & UCSB
DCQCN: A congestion control scheme for RoCEv2, to alleviate the problems of PFC.
MSRA
HKUST
Merged into TensorFlow.
Microsoft
Production experience in Microsoft Azure
Around 70% of traffic in Azure is RDMA.
NJU & Alibaba
Pangu
Production experience in Alibaba Cloud
Two workarounds to handle PFC storms: shutdown, RDMA/TCP switching.
Duke & Microsoft & SJTU
Develop a test suite to evaluate RDMA performance isolation solutions.
PFC: Priority Flow Control
RoCE: RDMA over Converged Ethernet
IBoE: InfiniBand over Ethernet
Fast Distributed Deep Learning over RDMA (EuroSys 2019) []
Towards Zero Copy Dataflows using RDMA (SIGCOMM 2017 Posters and Demos) [] []
Empowering Azure Storage with RDMA () []
When Cloud Storage Meets RDMA () []
Understanding RDMA Microarchitecture Resources for Performance Isolation () [] [] []