【RDMA】优化 RDMA 代码的提示和技巧

最新推荐文章于 2023-07-17 22:49:46 发布

bdview

最新推荐文章于 2023-07-17 22:49:46 发布

阅读量691

点赞数

文章标签： java

本文链接：https://blog.csdn.net/weixin_42319496/article/details/121122727

版权

本文提供了一系列关于如何优化RDMA代码的提示和技巧，包括避免在数据路径中使用控制操作，使用事件批量处理工作完成，减少scatter/gather条目，以及优化带宽、降低延迟、减少内存和CPU消耗的方法。通过这些优化，可以充分利用RDMA的高性能特性。

摘要由CSDN通过智能技术生成

RDMA is used in many places, mainly because of the high performance that it allows to achieve. In this post, I will provide tips and tricks on how to optimize RDMA code in several aspects.

General tips

Avoid using control operations in the data path

Unlike the data operations that stay in the same context that they were called in (i.e. don't perform a context switch) and they are written in optimized way, the control operations (all create/destroy/query/modify) operations are very expensive because:

Most of the time, they perform a context switch
Sometimes they allocate or free dynamic memory
Sometimes they involved in accessing the RDMA device

As a general rule of thumb, one should avoid calling control operations or decrease its use in the data path.

The following verbs are considered as data operations:

ibv_post_send()
ibv_post_recv()
ibv_post_srq_recv()
ibv_poll_cq()
ibv_req_notify_cq

When posting multiple WRs, post them in a list in one call

When posting several Work Requests to one of the ibv_post_*() verbs, posting multiple Work Requests as a linked list in one call instead of several calls each time with one Work Request will provide better performance since it allows the low-level driver to perform optimizations.

When using Work Completion events, acknowledge several events in one call

When handling Work Completions using events, acknowledging several completions in one call instead of several calls each time will provide better performance since less mutual exclusion locks are being performed.