Rolling-Unet：重振MLP对于医学图像分割高效提取长距离依赖的能力（从零到一，用思维导图的方式让你理解Rolling-Unet）

最新推荐文章于 2024-11-10 11:53:38 发布

秌枫

最新推荐文章于 2024-11-10 11:53:38 发布

阅读量1.4k

点赞数 40

文章标签：人工智能深度学习计算机视觉卷积神经网络

本文链接：https://blog.csdn.net/2303_79426621/article/details/140824745

版权

论文链接：Rolling-Unet: Revitalizing MLP’s Ability to Efficiently Extract Long-Distance Dependencies for Medical Image Segmentation | Proceedings of the AAAI Conference on Artificial Intelligencehttps://ojs.aaai.org/index.php/AAAI/article/view/28173

Rolling-Unet详细思维导图：

https://kdocs.cn/l/cjWmQTVaTzwshttps://kdocs.cn/l/cjWmQTVaTzws

由于组会讲解Unet变式，所以对Rolling-Unet学习，文章尽可能总结和更好的方式表现出Roling-UNet模型，初次涉及，有误的地方，请各位大佬指教哈。

目录一到五：都是对于原文的讲解（直接用自己开组会准备的ppt来解释）

目录六：本人对文章的解读和展示（原文看不懂不理解的可以直接看这个）

Rolling-Unet的思维导图在6.2存放

3.2 OR-MLP and DOR-MLP

3.3 Lo2 Block and Feature Incentive Block

四：Experiments

五：Conclusion

六：我对Rolling-Unet的理解

6.1Rolling-Unet采用了以下策略来改进网络结构：

6.2 采用了Lo2 Block来替代这些卷积层

一：Abstract（摘要）

Medical image segmentation methods based on deep learning network are mainly divided into CNN and Transformer. However, CNN struggles to capture long-distance dependencies, while Transformer suffers from high computational complexity and poor local feature learning. To efficiently extract and fuse local features and long-range dependencies, this paper proposes Rolling-Unet, which is a CNN model combined with MLP. Specifically, we propose the core R-MLP module, which is responsible for learning the long-distance dependency in a single direction of the whole image. By controlling and combining R-MLP modules in different directions, OR-MLP and DOR-MLP modules are formed to capture long-distance dependencies in multiple directions. Further, Lo2 block is proposed to encode both local context information and long-distance dependencies without excessive computational burden. Lo2 block has the same parameter size and computational complexity as a 3×3 convolution. The experimental results on four public datasets show that RollingUnet achieves superior performance compared to the state-ofthe-art methods.

基于深度学习网络的医学图像分割方法主要分为CNN和Transformer。然而，CNN很难捕获长距离依赖关系，而Transformer的计算复杂度高，局部特征学习能力差。为了有效地提取和融合局部特征和远程依赖关系，本文提出了一种结合MLP的CNN模型rollling - unet。具体来说，我们提出了核心R-MLP模块，该模块负责学习整个图像在单一方向上的长距离依赖关系。通过对不同方向的R-MLP模块进行控制和组合，形成OR-MLP和DOR-MLP模块，以捕获多方向的远程依赖关系。此外，在不增加计算负担的情况下，提出了Lo2块对本地上下文信息和远程依赖关系进行编码。Lo2块具有与3×3卷积相同的参数大小和计算复杂度。在四个公共数据集上的实验结果表明，RollingUnet的性能优于当前的方法。

二：本文的主要贡献

（1）提出了一种新的远程依赖捕获方法，并构建了R-MLP模块。

（2）在1的基础上，构建OR-MLP和DORMLP模块，可以获得更多方向上的远程依赖关系。

（3）在2的基础上，提出Lo2区块。它同时提取本地上下文信息和远程依赖关系，而不增加计算负担。Lo2块具有与3×3卷积相同的参数和计算级别。

（4）在3的基础上，构建了不同参数尺度的Rolling-Unet网络。在4个数据集上，Rolling-Unet的所有尺度都超过了现有方法，充分验证了我们方法的有效性。